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FROM THE PREFACE TO THE 
FIRST ENGLISH EDITION 


Tue present book is one of the series on Theoretical Physics, in which we 
endeavour to give an up-to-date account of various departments of that science. 
The complete series will contain the following nine volumes: 

1. Mechanics. 2. The classical theory of fields. 3. Quantum mechanics 
(non-relativistic theory). 4. Relativistic quantum theory. 5. Statistical physics. 
6. Fluid mechanics: 7. Theory of elasticity. 8. Electrodynamics of continuous 
media. 9. Physical kinetics. 

Of these, volumes 4 and 9 remain to be written. 

The scope of modern theoretical physics is very wide, and we have, of 
course, made no attempt to discuss in these books all that is now included in 
the subject. One of the principles which guided our choice of material was 
not to deal with those topics which could not properly be expounded without 
at the same time giving a detailed account of the existing experimental results. 
For this reason the greater part of nuclear physics, for example, lies outside the 
scope of these books. Another principle of selection was not to discuss very 
complicated applications of the theory. Both these criteria are, of course, 
to some extent subjective. 

We have tried to deal as fully as possible with those topics that are included. 
For this reason we do not, as a rule, give references to the original papers, 
but simply name their authors. We give bibliographical references only to 
work which contains matters not fully expounded by us, which by their com- 
plexity lie “on the borderline” as regards selection or rejection. We have 
tried also to indicate sources of material which might be of use for reference. 
Even with these limitations, however, the bibliography given makes no pre- 
tence of being exhaustive. 

We attempt to discuss general topics in such a way that the physical signifi- 
cance of the theory is exhibited as clearly as possible, and then to build up the 
mathematical formalism. In doing so, we do not aim at “mathematical 
rigour” of exposition, which in theoretical physics often amounts to self- 
deception. 

The present volume is devoted to non-relativistic quantum mechanics. By 
“relativistic theory” we here mean, in the widest sense, the theory of all 
quantum phenomena which significantly depend on the velocity of light. The 
volume on this subject (volume 4) will therefore contain not only Dirac’s 
relativistic theory and what is now known as quantum electrodynamics, but 
also the whole of the quantura theory of radiation. 


Institute of Physical Problems L. D. LANDAU 
USSR Academy of Sciences E. M. Lirsuitz 


August 1956 
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PREFACE TO THE 
SECOND ENGLISH EDITION 


For this second edition the book has been considerably revised and en- 
larged, but the general plan and style remain as before. Every chapter has 
been revised. In particular, extensive changes have been made in the sections 
dealing with the theory of the addition of angular momenta and with collision 
theory. A new chapter on nuclear structure has been added; in accordance 
with the general plan of the course, the subjects in question are discussed only 
to the extent that is proper without an accompanying detailed analysis of the 
experimental results. 

We should like to express our thanks to all our many colleagues whose 
comments have been utilized in the revision of the book. Numerous com- 
ments were received from V. L. Ginzburg and Ya. A. Smorodinskii. We are 
especially grateful to L. P. Pitaevskii for the great help which he has given in 
checking the formulae and the problems. 

Our sincere thanks are due to Dr. Sykes and Dr. Bell, who not only 
translated excellently both the first and the second edition of the book, but 
also made a number of useful comments and assisted in the detection of 
various misprints in the first edition. 

Finally, we are grateful to the Pergamon Press, which always acceded to 
our requests during the production of the book. 


L. D. LANDAU 
October 1964 E. M. LIFsHITZ 


PREFACE TO THE 
THIRD RUSSIAN EDITION 


THE previous edition of this volume was the last book on which I worked 
together with my teacher L. D. Landau. The revision and expansion that 
we then carried out was very considerable, and affected every chapter. 

For the third edition, naturally, much less revision was needed. Never- 
theless, a fair amount of new material has been added, including some more 
problems, and relating both to recent research and to earlier results that 
have now become of greater significance. 

Landau’s astonishing grasp of theoretical physics often enabled him to 
dispense with any consultation of original papers: he was able to derive 
results by methods of his own choice. This may have been the reason why 
our book did not contain certain necessary references to other authors. In 
the present edition, I have tried to supply them as far as possible. I have 
also added references to the work of Landau himself where we describe 
results or methods that are due to him personally and have not been 
published elsewhere. 

As when dealing with the revision of other volumes in the Course of 
Theoretical Physics, I have had the assistance of numerous colleagues who 
informed me either of deficiencies in the treatment given previously, or of 
new material that should be added. Many useful suggestions incorporated 
in this book have come from A. M. Brodsku, G. F. Drukarev, I. G. Kaplan, 
V. P. Krainov, I. B. Levinson, P. E. Nemirovsku, V. L. Pokrovski, 
I. I. Sobel man, and I. S. Shapiro. Myv sincere thanks are due to all of 
these. 

The whole of the work on revising this volume has been done in close 
collaboration with L. P. Pitaevskii. In him I have had the good fortune 
to find a fellow-worker who has passed likewise through the school of 
Landau and is inspired by the same ideals in the service of science. 


Moscow E. M. LirsHitz 
November 1973 
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EDITOR'S PREFAGE TO THE 
FOURTH RUSSIAN EDITION 


In this edition of Quantum Mechanics some misprints and errors noted 
since the publication of the third edition have been corrected. Some 
small improvements have also been made, and several problems have 
been added. 

I am grateful to all readers who have provided me with comments. 


May 1988 L. P. PITAEVskii 
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NOTATION 


a 


Operators are denoted by a circumflex: f 
dl” volume element in coordinate space 
dg element in configuration space 

d3p element in momentum space 

fnm = f; = <n| f |m) matrix elements of the quantity f (see definition in §11) 
wnm = (En— Em)/Î transition frequency 

{f, 2} = fÉ — êf commutator of two operators 

Ĥ Hamiltonian 

ô, phase shifts of wave functions 

Atomic and Coulomb units (see beginning of §36) 

Vector and tensor indices are denoted by Latin letters 7, k, I 


eip} antisymmetric unit tensor (see §26) 


References to other volumes in the Course of Theoretical Physics: 

Mechanics = Vol. 1 (Mechanics, third English edition, 1976). 

Fields = Vol. 2 (The Classical Theory of Fields, fourth English edition, 
19757: 

ROT or Relativistic Quantum Theory = Vol. 4 (Relativistic Quantum Theory, 
first English edition, Part 1, 1971; Part 2, 1974); the second English edition 
appeared in one volume as Quantum Electrodynamics, 1982. 

All are published by Pergamon Press. 
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CHAPTER I 


THE BASIC: CONCEPTS OF 
QUANTUM MECHANICS 


§1. The uncertainty principle 


WHEN we attempt to apply classical mechanics and electrodynamics to explain 
atomic phenomena, they lead to results which are in obvious conflict with 
experiment. This is very clearly seen from the contradiction obtained on 
applying ordinary electrodynamics to a model of an atom in which the elec- 
trons move round the nucleus in classical orbits. During such motion, as in 
any accelerated motion of charges, the electrons would have to emit electro- 
magnetic waves continually. By this emission, the electrons would lose their 
energy, and this would eventually cause them to fall into the nucleus. Thus, 
according to classical electrodynamics, the atom would be unstable, which 
does not at all agree with reality. 

This marked contradiction between theory and experiment indicates that 
the construction of a theory applicable to atomic phenomena—that is, pheno- 
mena œ curring in particles of very small mass at very small distances— 
demands a fundamental modification of the basic physical concepts and laws. 

As a starting-point for an investigation of these modifications, it is conveni- 
ent to take the experimentally observed phenomenon known as electron 
diffraction.t It is found that, when a homogeneous beam of electrons passes 
through a crystal, the emergent beam exhibits a pattern of alternate maxima 
and minima of intensity, wholly similar to the diffraction pattern observed 
in the diffraction of electromagnetic waves. Thus, under certain conditions, 
the behaviour of material particles—in this case, the electrons—displays 
features belonging to wave processes. 

How markedly this phenomenon contradicts the usual ideas of motion is 
best seen from the following imaginary experiment, an idealization of the 
experiment of electron diffraction by a crystal. Let us imagine a screen 
impermeable to electrons, in which two slits are cut. On observing the 
passage of a beam of electrons] through one of the slits, the other being 
covered, we obtain, on a continuous screen placed behind the slit, some pat- 
tern of intensity distribution; in the same way, by uncovering the second 


i ee ee 


t The phenomenon ot electron diffraction was in fact discovered after quantum mechanics 
was invented. In our discussion, however, we shall not adhere to the historical sequence of 
development of the theory, but shall endeavour to construct it in such a way that the con- 
nection between the basic principles of quantum mechanics and the expertmentally observed 
phenomena 1s most clearly shown 

t The beam ts supposed so rarefied that the interaction of the particles in it plays no part. 
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slit and covering the first, we obtain another pattern. On observing the 
passage of the beam through both slits, we should expect, on the basis of 
ordinary classical ideas, a pattern which is a simple superposition of the other 
two: each electron, moving in its path, passes through one of the slits and 
has no effect on the electrons passing through the other slit. The phenomenon 
of electron diffraction shows, however, that in reality we obtain a diffraction 
pattern which, owing to interference, does not at all correspond to the sum 
of the patterns given by each slit separately. It is clear that this result can 
in no way be reconciled with the idea that electrons move in paths. 

Thus the mechanics which governs atomic phenomena—quantum mechanics 
or wave mechanics—-must be based on ideas of motion which are fundamentally 
different from those of classical mechanics. In quantum mechanics there is 
no such concept as the path of a particle. This forms the content of what is 
called the uncertainty principle, one of the fundamental principles of quantum 
mechanics, discovered by W. Heisenberg in 1927.+ 

In that it rejects the ordinary ideas of classical mechanics, the uncertainty 
principle might be said to be negative in content. Of course, this principle 
in itself does not suffice as a basis on which to construct a new mechanics of 
particles. Such a theory must naturally be founded on some positive asser- 
tions, which we shall discuss below (§2). However, in order to formulate 
these assertions, we must first ascertain the statement of the problems which 
confront quantum mechanics. To do so, we first examine the special nature 
of the interrelation between quantum mechanics and classical mechanics. A 
more general theory can usually be formulated in a logically complete manner, 
independently of a less general theory which forms a limiting case of it. Thus, 
relativistic mechanics can be constructed on the basis of its owr fundamental 
principles, without any reference to Newtonian mechanics. Itis in principle 
impossible, however, to formulate the basic concepts of quantum mechanics 
without using classical mechanics. The fact that an electron] has no definite 
path means that it has also, in itself, no other dynamical characteristics. || 
Hence it is clear that, for a system composed only of quantum objects, 
it would be entirely impossible to construct any logically independent 
mechanics. The possibility of a quantitative description of the motion of an 
electron requires the presence also of physical objects which obey classical 
mechanics to a sufficient degree of accuracy. If an electron interacts with 
such a ‘‘classical object”, the state of the latter is, generally speaking, altered. 
The nature and magnitude of this change depend on the state of the electron, 
and therefore may serve to characterize it quantitatively. 

In this connection the ‘‘classical object” is usually called apparatus, and 


+ It is of interest to note that the complete mathematical formalism of quantum mechanics 
was constructed by W. Heisenberg and E. Schrödinger in 1925-6, before the discovery of 
the uncertainty principle, which revealed the physical content of this formalism. 

1 In this and the following sections we shall, for brevity, speak of tan electron”, meaning 
in general any object of a quantum nature, i.e. a particle or system of particles obeying 
quantum mechanics and not classical mechanics 

| We refer to quantities which characterize the motion of the electron, and not to those, 
such as the charge and the mass, which relate to it as a particle; these are parameters. 
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its interaction with the electron is spoken of as measurement. However, it 
must be emphasized that we are here not discussing a process of measurement 
in which the physicist-observer takes part. By measurement, in quantum 
mechanics, we understand any process of interaction between classical and 
quantum objects, occurring apart from and independently of any observer. 
The importance of the concept of measurement in quantum mechanics was 
elucidated by N. Bohr. 

We have defined “‘apparatus”’ as a physical object which is governed, with 
sufficient accuracy, by classical mechanics. Such, for instance, is a body 
of large enough mass. However, it must not be supposed that apparatus is 
necessarily macroscopic. Under certain conditions, the part of apparatus may 
also be taken by an object which is microscopic, since the idea of ‘“‘with 
sufficient accuracy” depends on the actual problem proposed. Thus, the 
motion of an electron in a Wilson chamber is observed by means of the 
cloudy track which it leaves, and the thickness of this is large-compared with 
atomic dimensions; when the path is determined with such low accuracy, 
the electron is an entirely classical object. 

Thus quantum mechanics occupies a very unusual place among physical 
theories: it contains classical mechanics as a limiting case, yet at the same 
time it requires this limiting case for its own formulation. 

We may now formulate the problem of quantum mechanics. A typical 
problem consists in predicting the result of a subsequent measurement from 
the known results of previous measurements. Moreover, we shall see later 
that, in comparison with classical mechanics, quantum mecnanics, generally 
speaking, restricts the range of values which can be taken by various physical 
quantities (for example, energy): that is, the values which can be obtained 
as a result of measuring the quantity concerned. The methods of quantum 
mechanics must enable us to determine these admissible values. 

The measuring process has in quantum mechanics a very important pro- 
perty: it always affects the electron subjected to it, and it is in principle 
impossible to make its effect arbitrarily small, for a given accuracy of measure- 
ment. The more exact the measurement, the stronger the effect exerted by 
it, and only in measurements of very low accuracy can the effect on the mea- 
sured object be small. This property of measurements is logically related 
to the fact that the dynamical characteristics of the electron appear only as a 
result of the measurement itself. It is clear that, if the effect of the measuring 
process on the object of it could be made arbitrarily small, this would mean 
that the measured quantity has in itself a definite value independent of the 
measurement. 

Among the various kinds of measurement, the measurement of the co- 
ordinates of the electron plays a fundamental part. Within the limits of 
applicability of quantum mechanics, a measurement of the coordinates of an 
electron can always be performed} with any desired accuracy. 


a r a e 


+ Once again we emphasize that, in speaking of ‘performing a measurement”, we refer to 
the interaction of an electron with a classical ‘apparatus’, which in no way presupposes 
the presence of an external observer. 
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Let us suppose that, at definite time intervals Az, successive measurements of 
the coordinates of an electron are made. The results will not in general lie 
on a smooth curve. On the contrary, the more accurately the measurements 
are made, the more discontinuous and disorderly will be the variation of 
their results, in accordance with the non-existence of a path of the electron. 
A fairly smooth path is obtained only if the coordinates of the electron are 
measured with a low degree of accuracy, as for instance from the condensa- 
tion of vapour droplets in a Wilson chamber. 

If now, leaving the accuracy of the measurements unchanged, we diminish 
the intervals At between measurements, then adjacent measurements, of 
course, give neighbouring values of the coordinates. However, the results 
of a series of successive measurements, though they lie in a small region of 
space, will be distributed in this region in a wholly irregular manner, lying on 
no smooth curve. In particular, as At tends to zero, the results of adjacent 
measurements by no means tend to lie on one straight line. 

This circumstance shows that, in quantum mechanics, there is no such 
concept as the velocity of a particle in the classical sense of the word, i.e. the 
limit to which the difference of the coordinates at two instants, divided by 
the interval At between these instants, tends as At tends to zero. However, 
we shall see later that in quantum mechanics, nevertheless, a reasonable 
definition of the velocity of a particle at a given instant can be constructed, 
and this velocity passes into the classical velocity as we pass to classical mech- 
anics. But whereas in classical mechanics a particle has definite coordinates 
and velocity at any given instant, in quantum mechanics the situation is 
entirely different. If, as aresult of measurement, the electron is found to have 
definite co. dinates, then it has no definite velocity whatever. Conversely, 
if the electron has a definite velocity, it cannot have a definite position in 
space. For the simultaneous existence of the coordinates and velocity would 
mean the existence of a definite path, which the electron has not. Thus, in 
quantum mechanics, the coordinates and velocity of an electron are quantities 
which cannot be simultaneously measured exactly, i.e. they cannot simultane- 
ously have definite values. We may sav that the coordinates and velocity 
of the electron are quantities which do not exist simultaneously. In what 
follows we shall derive the quantitative relation which determines the pos- 
sibility of an inexact measurement of the coordinates and velocity at the same 
instant. 

A complete description of the state of a physical system in classical mech- 
anics 1s effected by stating all its coordinates and velocities at a given instant; 
with these initial data, the equations of motion completely determine the 
behaviour of the system at all subsequent instants. In quantum mechanics 
such a description is in principle impossible, since the coordinates and the 
corresponding velocities cannot exist simultaneously. Thus a description 
of the state of a quantum system is effected by means of a smaller number of 
quantities than in classical mechanics, i.e. it is less detailed than a classical 
description. 

A very important consequence follows from this regarding the nature of the 
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predictions made in quantum mechanics. Whereas a classical description 
suffices to predict the future motion of a mechanical system with complete 
accuracy, the less detailed description given in quantum mechanics evidently 
cannot be enough to do this. This means that, even if an electron is in a state 
described in the most complete manner possible in quantum mechanics, its 
behaviour at subsequent instants is still in principle uncertain. Hence quan- 
tum mechanics cannot make completely definite predictions concerning the 
future behaviour of the electron. For a given initial state of the electron, a 
subsequent measurement can give various results. The problem in 
quantum mechanics consists in determining the probability of obtaining vari- 
ous results on performing this measurement. It is understood, of course, 
that in some cases the probability of a given result of measurement may be 
equal to unity, i.e. certainty, so that the result of that measurement is unique. 

All measuring processes in quantum mechanics may be divided into two 
classes. In one, which contains the majority of measurements, we find those 
which do not, in any state of the system, lead with certainty to a unique 
result. The other class contains measurements such that for every possible 
result of measurement there is a state in which the measurement leads with 
certainty to that result. These latter measurements, which may be called 
predictable, play an important part in quantum mechanics. The quantitative 
characteristics of a state which are determined by such measurements are 
what are called physical quantities in quantum mechanics. If in some state 
a measurement gives with certainty a unique result, we shall say that in this 
state the corresponding physical quantity has a definite value. In future we 
shall always understand the expression “‘physical quantity” in the sense given 
here. 

We shall often find in what follows that by no means every set of physical 
quantities in quantum mechanics can be measured simultaneously, i.e. can 
all have definite values at the same time. We have already mentioned one 
example, namely the velocity and coordinates of an electron. An important 
part is played in quantum mechanics by sets of physical quantities having 
the following property: these quantities can be measured simultaneously, 
but if they simultaneously have definite values, no other physical quantity 
(not being a function of these) can have a definite value in that state. We 
shall speak of such sets of physical quantities as complete sets. 

Any description of the state of an electron arises as a result of some mea- 
surement. We shall now formulate the meaning of a complete description of 
a state in quantum mechanics. Completely described states occur as a result 
of the simultaneous measurement of a complete set of physical quanti- 
ties. From the results of such a measurement we can, in particular, deter- 
mine the probability of various results of any subsequent measurement, 
regardless of the history of the electron prior to the first measurement. 


From now on (except in §14) we shall understand by the states of a quan- 
tum system just these completely described states. 
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§2. The principle of superposition 


The radical change in the physical concepts of motion in quantum 
mechanics as compared with classical mechanics demands, of course, an 
equally radical change in the mathematical formalism of the theory. We 
must therefore consider first of all the way in which states are described in 
quantum mechanics. 

We shall denote by g the set of coordinates of a quantum system, and by dg 
the product of the differentials of these coordinates. This dg is called an 
element of volume in the configuration space of the system; for one particle, 
dg coincides with an element of volume dV’ in ordinary space. 

The basis of the mathematical formalism of quantum mechanics lies in the 
proposition that the state of a system can be described by a definite (in 
general complex) function ¥(q) of the coordinates. The square of the 
modulus of this function determines the probability distribution of the values 
of the coordinates: 
on the system will find the values of the coordinates to be in the element dg 
of configuration space. The function ¥ is called the wave function of the 
system.T 

A knowledge of the wave function allows us, in principle, to calculate the 
probability of the various results of any measurement (not necessarily of the 
coordinates) also. All these probabilities are determined by expressions 
bilinear in Y and ¥*. The most general form of such an expression is 


|| EO Ea’) la, g’) dgdg', (2.1) 





where the function ¢(g¢, g’) depends on the nature and the result of the mea- 
surement, and the integration is extended over all configuration space. The 
probability Y°Y"* of various values of the coordinates is itself an expression 
of this type.] 

The state of the system, and with it the wave function, in general varies 
with time. In this sense the wave function can be regarded as a function of 
time also. If the wave function is known at some initial instant, then, from 
the very meaning of the concept of complete description of a state, it is in 
principle determined at every succeeding instant. The actual dependence 
of the wave function on time is determined by equations which will be de- 
rived later. 

The sum of the probabilities of all possible values of the coordinates of 
the system must, by definition, be equal to unity. It is therefore necessary 
that the result of integrating ||? over all configuration space should be equal 
to unity: 


[ t¥ltdg =1. (2.2) 


t It was first introduced into quantum mechanics by Schrödinger in 1926. 

J It is obtained from (2.1) when ¢(g, 9’) = lg —go) ôlg'—qgo), where ô denotes the delta 
function, defined in §5 below; g, denotes the value of the coordinates whose probability 
is required. 
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This equation is what is called the normalisation condition for wave functions. 
If the integral of |¥ i? converges, then by choosing an appropriate constant 
coefficient the function ¥ can always be, as we say, normalized. However, we 
shall see later that the integral of |F|? may diverge, and then cannot be 
normalized by the condition (2.2). In such cases |'F|* does not, of course, 
determine the absolute values of the probability of the coordinates, but the 
ratio of the values of |F|? at two different points of configuration space deter- 
mines the relative probability of the corresponding values of the coordinates. 

Since all quantities calculated by means of the wave function, and having a 
direct physical meaning, are of the form (2.1), in which ¥ appears multiplied 
by ¥*, it is clear that the normalized wave function is determined only to 
within a constant phase factor of the form eè» (where x is any real number). 
This indeterminacy is in principle irremovable; it is, however, unimportant, 
since it has no effect upon any physical results 

The positive content of quantum mechanics is founded on a series of 
propositions concerning the properties of the wave function. These are as 
follows. 

Suppose that, in a state with wave function V,(g), some measurement leads 
with certainty to a definite result (result 1), while in a state with ¥,(q) it 
leads to result 2. ‘Then it is assumed that every linear combination of Y, 
and Y, 1.e. every function of the form c,‘VY,+¢,'¥, (where c and c, are con- 
stants), gives a state in which that measurement leads to either result 1 or 
result 2. Moreover, we can assert that, if we know the time dependence of 
the states, which for the one case is given by the function Y’,(g, t), and for the 
other by ‘,(g, £), then any linear combination also gives a possible dependence 
of a state on time. These propositions constitute what is called the principle 
of superposition of states, the chief positive principle of quantum mechanics. 
In particular, it follows from this principle that all equations satisfied by 
wave functions must be linear in Y. 

Let us consider a system composed of two parts, and suppose that the state 
of this system is given in such a way that each of its parts is completely 
described.t Then we can say that the probabilities of the coordinates q; of 
the first part are independent of the probabilities of the coordinates qs of the 
second part, and therefore the probability distribution for the whole system 
should be equal to the product of the probabilities of its parts. This means 
that the wave function ‘’.(9,, g2) of the system can be represented in the form 
of a product of the wave functions ‘\(g,) and ‘¥,(qq) of its parts: 


Fie(gi» Ye) = ‘F (91) Folge). (2.3) 


If the two parts do not interact, then this relation between the wave function 
of the system and those of its parts will be maintained at future instants also, 


+ This, of course, means that the state of the whole system is completely described also. 
However, we emphasize that the converse statement is by no means true: a complete descrip- 
tion of the state of the whole system does not in general completely determine the states of 
its individual parts (see also $14). 
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i.e. we can write 


Filgi» Yor t) = Fill t) Falga t). (2.4) 


§3. Operators 

Let us consider some physical quantity f which characterizes the state 
of a quantum system. Strictly, we should speak in the following discussion 
not of one quantity, but of a complete set of them at the same time. 
However, the discussion is not essentially changed by this, and for brevity 
and simplicity we shall work below in terms of only one physical quantity. 

The values which a given physical quantity can take are called in quantum 
mechanics its ezgenvalues, and the set of these is referred to as the spectrum 
of eigenvalues of the given quantity. In classical mechanics, generally speak- 
ing, quantities run through a continuous series of values. In quantum mech- 
anics also there are physical quantities (for instance, the coordinates) whose 
eigenvalues occupy a continuous range; in such cases we speak of a continuous 
spectrum of eigenvalues. As well as such quantities, however, there exist in 
quantum mechanics others whose eigenvalues form some discrete set; in 
such cases we speak of a discrete spectrum. 

We shall suppose for simplicity that the quantity f considered here has a 
discrete spectrum; the case of a continuous spectrum will be discussed in §5. 
The eigenvalues of the quantity f are denoted by fa, where the suffix n takes 
the values 0, 1, 2,3, .... We also denote the wave function of the system, in 
the state where the quantity f has the value fa, by Yp. The wave functions 
YF, are called the eigenfunctions of the given physical quantity f. Each of these 
functions is supposed normalized, so that 


J Dal? dg = 1. (3.1) 


If the system is in some arbitrary state with wave function Y, a measure- 
ment of the quantity f carried out on it will give as a result one of the eigen- 
values f,. In accordance with the principle of superposition, we can assert 
that the wave function ¥ must be a linear combination of those eigenfunc- 
tions ¥„ which correspond to the values f,, that can be obtained, with prob- 
ability different from zero, when a measurement is made on the system and 
it is in the state considered. Hence, in the general case of an arbitrary state, 
the function can be represented in the form of a series 


Y= Da,¥,, (3.2) 


where the summation extends over all , and the a, are some constant coeffi- 
cients. 

Thus we reach the conclusion that any wave function can be, as we say, 
expanded in terms of the eigenfunctions of any physical quantity. A set of 
functions in terms of which such an expansion can be made is called a complete 
(or closed) set. 

The expansion (3.2) makes it possible to determine the probability of find- 
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ing (i.e. the probability of getting the corresponding result on measurement), 
in a system in a state with wave function ’, any given value f, of the quantity 
f. For, according to what was said in the previous section, these probabili- 
ties must be determined by some expressions bilinear in ¥ and Y*, and 
therefore must be bilinear in a„ and a,,*. Furthermore, these expressions 
must, of course, be positive. Finally, the probability of the value f, must 
become unity if the system is in a state with wave function ‘Y = ’,, and 
must become zero if there is no term containing ¥, in the expansion (3.2) 
of the wave function ¥. The only essentially positive quantity satisfying 
these conditions is the square of the modulus of the coefficient an. Thus we 
reach the result that the squared modulus |an|? of each coefficient in the 
expansion (3.2) determines the probability of the corresponding value fn of 
the quantity f in the state with wave function Y. The sum of the probabi- 
lities of all possible values fn must be equal to unity; in other words, the 
relation 


2 — 
zi = 1 (3.3) 


must hold. 

If the function ¥ were not normalized, then the relation (3.3) would not 
hold either. The sum È |a|? would then be given by some expression 
bilinear in ¥ and ¥*, and becoming unity when Y was normalized. Only 
the integral f FY* dg is such an expression. Thus the equation 


Nv x ak 
= Un Gn = | YY* dg (3.4) 


must hold. 
On the other hand, multiplying by the expansion ¥* = Za,*¥,,* of 
the function ¥* (the complex conjugate of ¥), and integrating, we obtain 


| WY" dg = Da," i rE dg. 
Comparing this with (3.4), we have 


seo al) P 
2 R Pe a lnr | Ye Y do, 


from which we derive the following formula determining the coefficients a, 
in the expansion of the function ¥ in terms of the eigenfunctions ‘F,: 


a= Í YF * dg. (3.5) 
If we substitute here from (3.2), we obtain 


On = È am | Fla” dg, 


from which it is evident that the eigenfunctions must satisfy the conditions 


[Ear dg = Sni (3.6) 
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where 6,,, = 1 form = mand pm = Oforn # m. The fact that the integrals 
of the products ¥,4,* with m # n vanish is called the orthogonality of the 
functions ¥,. Thus the set of eigenfunctions ¥,„ forms a complete set of 
normalized and orthogonal (or, for brevity, orthonormal) functions. 

We shall now introduce the concept of the mean value f of the quantity f 
in the given state. In accordance with the usual definition of mean values, 
we define f as the sum of all the eigenvalues f„ of the given quantity, each 
multiplied by the corresponding probability Ja,,|?. Thus 


f= E frlen!?. (3.7) 


We shall write f in the form of an expression which does not contain the 
coefficients a, in the expansion of the function Y, but this function itself. 
Since the products a,a,,* appear in (3.7), it is clear that the required expres- 
sion must be bilinear in ¥ and Y*. We introduce a mathematical opera- 
tor, which we denotet by f and define as follows. Let (fF) denote the result 
of the operator f acting on the function ¥. We define f in such a way that 
the integral of the product of (fF) and the complex conjugate function Y* 
is equal to the mean value f: 


J= | YGF) dg. (3.8) 


It is easily seen that, in the general case, the operator f is a linear 
integral operator. For, using the expression (3.5) for a, we can rewrite the 
definition (3.7) of the mean value in the form 


e a 2,527 yaa. 


Comparing this with (3.8), we see that the result of the operator f acting on 
the function ¥ has the form 


(FE) = E anf n (3.9) 


If we substitute here the expression (3.5) for a,, we find that f is an integral 
operator of the form 


SE) = | Kia, YEG) dg’. (3.10) 
where the function K(q, q’) (called the kernel of the operator) is 
K(q, 7) = E faa" Fal). (3.11) 


Thus, for every physical quantity in quantum mechanics, there is a definite 
corresponding linear operator. 
It is seen from (3.9) that, if the function ¥ is one of the eigenfunctions Fp 


+ By convention, we shall always denote operators bv letters with circumflexes. 
t An operator is said to be linear 1f ıt has the properties 


fev, 44.) =f Y+ YF; and fiat) = af ¥, 


where ¥, and Y, are arbitrary functions and a is an arbitrary constant. 
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(so that all the a„ except one are zero), then, when the operator f acts on it, 
this function is simply multiplied by the corresponding eigenvalue fyn: 


JEn = Safn (3.12) 


(In what follows we shall always omit the parentheses in the expression 
(PF), where this cannot cause any misunderstanding; the operator is taken 
to act on the expression which follows it.) Thus we can say that the eigen- 
functions of the given physical quantity f are the solutions of the equation 


fast, 


where f is a constant, and the eigenvalues are the values of this constant for 
which the above equation has solutions satisfying the required conditions. 
As we shall see below, the form of the operators for various physical 
quantities can be determined from direct physical considerations, and then 
the above property of the operators enables us to find the eigenfunctions 
and eigenvalues by solving the equations f¥ =f. 

Both the eigenvalues of a real physical quantity and its mean value in 
every state are real. This imposes a restriction on the corresponding 
operators. Equating the expression (3.8) to its complex conjugate, we 
obtain the relation 


[ wef) dg = | FUE) de, (3.13) 


where f* denotes the operator which is the complex conjugate of f.t This 
relation does not hold in general for an arbitrary linear operator, so that it is 
a restriction on the form of the operator f. For an arbitrary operator f we 
can find what is called the transposed operator a defined in such a way that 


foryd = | ¥( FO) dg, (3.14) 


where ¥ and © are two different functions. If we take, as the function ®, 
the function ¥* which is the complex conjugate of ‘’, then a comparison with 
(3.13) shows that we must have 


f=f*. (3.15) 


Operators satisfying this condition are said to be Hermitian.{ ‘Thus the 
operators corresponding, in the mathematical formalism of quantum 
mechanics, to real physical quantities must be Hermitian. 

We can formally consider complex physical quantities also, i.e. those 
whose eigenvalues are complex. Let f be such a quantity. Then we can 
introduce its complex conjugate quantity f*, whose eigenvalues are the com- 
plex conjugates of those of f. We denote by f+ the operator corresponding 
to the quantity f*. It is called the Hermitian conjugate of the operator f and, 


+ By definition, if for the operator f we have fy = ¢, then the complex conjugate operator 
f* is that for which we have f*y* = $* 

t For a linear integral operator of the form (3.10), the Hermitian condition means that 
the kernel of the operator must be such that K(q, q) =K*(q’, q). 
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in general, will be different from the definition of the operator f*: the mean 
value of the quantity f* in a state Y is 


7? SP 


We also have 
(fy = (| ere a 
= [ Wh r* dg 


< Í yaf dq. 


Equating these two expressions gives 


fr= fr, (3.16) 


from which it is clear that f+ is in general not the same as /*. 
The condition (3,15) can now be written 


f=fe, (3.17) 


i.e. the operator of a real physical quantity ıs che same as its Hermitian 
conjugate (Hermitian operators are also callec self-conjugate). 

We shall show how the orthogonality of the eigenfunctions of an Hermitian 
operator corresponding to different eigenvalues can be directly proved. Let 
fn and fm be two different eigenvalues of the real quantity f, and Yn, Ym the 
corresponding eigenfunctions: 


f? =) a oT = fm on 


Multiplying both sides of the first of these equations by ¥„*, and both 
sides of the complex conjugate of the second by ¥,, and subtracting corre- 
sponding terms, we find 


k tte pa Ef Oe mt = (fet my? aon 


We integrate both sides of this equation over g. Since f*= f by (3.14) the 
integral on the left-hand side of the equation is zero, so that we have 


(fr—fm) | EnEn” dq = 0, 


whence, since fp, # fm, we obtain the required orthogonality property of the 
functions ¥, and Ym- 

We have spoken here of only one physical quantity f, whereas, as we said 
at the beginning of this section, we should have spoken of a complete set 
of simultaneously measurable physical quantities. We should then have 
found that to each of these quantities f, g, ... there corresponds its operator 
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f, ê, .... The eigenfunctions Y’, then correspond to states in which all the 
quantities concerned have definite values, i.e. they correspond to definite 
sets of eigenvalues fn, £n, --- , and are simultaneous solutions of the system 
of equations 


fY =fF, EY =z"... 


§4. Addition and multiplication of operators 


If fand g are th operators corresponding to two physical quantities f and 
g, the sum f+g has a corresponding operator f+. However, the signi- 
ficance of adding different physical quantities in quantum mechanics 
depends considerably on whether the quantities are or are not simul- 
taneously measurable. If f and g are simultaneously measurable, the 
operators f and g have common eigenfunctions, which are also eigen- 
functions of the operator f+8, and the eigenvalues of the latter operator are 
equal to the sums f, + gn. But if f and g cannot simultaneously take definite 
values, their sum f+g has a more restricted significance. We can assert only 
that the mean value of this quantity in any state is equal to the sum of the 
mean values of the separate quantities: 


ftg = fre. (4.1) 


The eigenvalues and eigenfunctions of the operator f+ will not, in general, 
now bear any relation to those of the quantities f and g. It is evident that, 
if the operators f and g are Hermitian, the operator f+ will be so too, so 
that its eigenvalues are real and are equal to those of the new quantity f+g 
thus defined. 

The following theorem should be noted. Let fọ and gy be the smallest 
eigenvalues of the quantities f and g, and (f+g), that of the quantity f+g. 
Then 


S+2)0 = fot£o- (4.2) 


The equality holds if f and g can be measured simultaneously. The proof 
follows from the obvious fact that the mean value of a quantity is always 
greater than or equal to its least eigenvalue. In a state in which the quantity 
f+g has the value (f+g)) we have f+g = (f+g)>, and since, on the other 
hand, f+g = /4+é = fot&o, we arrive at the inequality (4.2). 

Next, let f and g once more be quantities that can be measured simultane- 
ously. Besides their sum, we can also introduce the concept of their product 
as being a quantity whose eigenvalues are equal to the products of those of the 
quantities f and g. It is easy to see that, to this quantity, there corresponds 
an operator whose effect consists of the successive action on the function 
of first one and then the other operator. Such an operator is represented 
mathematically by the product of the operators f and g. For, if ¥,„ are the 
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eigenfunctions common to the operators f and g, we have 
ee = fpr) = AT oi E E, z Eala la 


(the symbol fê denotes an operator whose effect on a function ¥ consists of 
the successive action first of the operator @ on the function Y and then of the 
operator f on the function g). We could equally well take the operator £ 3 
instead of ff, the former differing from the latter in the order of its factors. 
It is obvious that the result of the action of either of these operators on the 
functions ‘, will be the same. Since, however, every wave function ¥ can 
be represented as a linear combination of the functions ¥,, it follows that 
the result of the action of the operators fê and gf on an arbitrary function will 
also be the same. This fact can be written in the form of the symbolic 


equation fê = ¿f or 
fé—éf = 0. (4.3) 


Two such operators f and £ are said to be commutative, or to commute 
with each other. Thus we arrive at the important result: if two quantities 
f and g can simultaneously take definite values, then their operators com- 
mute with each other. 

The converse theorem can also be proved (§11): if the operators f and g 
commute, then all their eigenfunctions can be taken common to both; 
physically, this means that the corresponding physical quantities can be 
measured simultaneously. Thus the commutability of the operators is a 
necessary and sufficient condition for the physical quantities to be simultane- 
ously measurable. 

A particular case of the product of operators is an operator raised to some 
power. From the above discussion we can deduce that the eigenvalues of an 
operator f? (where p is an integer) are equal to the pth powers of the eigen- 
values of the operator f. Any function ¢(/) of an operator can be defined 
as an operator whose eigenvalues are equal to the same function ¢(f) of 
the eigenvalues of the operator f. If the function ¢( f) can be expanded as a 
Taylor series, this expresses the effect of the operator ¢( f ) in terms of those 
of various powers f ?. 

In particular, the operator f ~1 is called the inverse of the operator Akie 
evident that the successive action of the operators f and f —! on any function 
leaves the latter unchanged, i.e. ff = ff = 1. 

If the quantities f and g cannot be measured simultaneously, the concept 
of their product does not have the same direct meaning. This appears in 
the fact that the operator ff is not Hermitian in this case, and hence cannot 
correspond to any real physical quantity. For, by the definition of the 
transpose of an operator we can write 


[ABO dq = [ ERED) dg = | (EOF Y) de. 


Here the operator f acts only on the function Y¥, and the operator £ on ®, so 
that the integrand is a simple product of two functions £@ and f¥. Again 


§5 The continuous spectrum 15 


using the definition of the transpose of an operator, we can write 


[ ¥/0 dq = f (FEED) dg =f Off ¥ ag. 


Thus we obtain an integral in which the functions ¥ and @ have changed 
places as compared with the original one. In other words, the operator Bf 
is the transpose of fê, and we can write 


fe =E (4.4) 


i.e. the transpose of the product f$ is the product of the transposes of the 
factors written in the opposite order. Taking the complex conjugate of both 
sides of equation (4.4), we have 


(fay = Sf. (4.5) 

If each of the operators f and ¢ is Hermitian, then ( fg)" = éf. It follows 

from this that the operator f$ is Hermitian if and only if the factors f and ć 
commute. 


We note that, from the products ff and £f of two non-commuting Hermitian 
operators, we can form an Hermitian operator, the symmetrized product 


+EP). (4.6) 


It is easy to see that the difference /é—£f is an anti-Hermitian operator 
(i.e. one for which the transpose is equal to the complex conjugate taken with 
the opposite sign). It can be made Hermitian by multiplying by z; thus 


ER (4.7) 
is again an Hermitian operator. 
In what follows we shall sometimes use for brevity the notation 


E = f éf, (4.8) 
called the commutator of these operators. It is easily seen that 
E, hy = Uf ME+Ké, h). (4.9) 


We notice that, if {f h} = 0 and {é, Á} = 0, it does not in general follow 
that f and é commute. 


§5. The continuous spectrum 


All the relations given in §§3 and 4, describing the properties of the eigen- 
functions of a discrete spectrum, can be generalized without difficulty to the 
case of a continuous spectrum of eigenvalues. 

Let f be a physical quantity having a continuous spectrum. We shall 
denote its eigenvalues by the same letter f simply, and the corresponding 
eigenfunctions by Fp. Just as an arbitrary wave function ¥ can be expanded 
in a series (3.2) of eigenfunctions of a quantity having a discrete spectrum, 
it can also be expanded (this time as an integral) in terms of the complete 
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set of eigenfunctions of a quantity with a continuous spectrum. This 
expansion has the form 


F) = | a¥ (9) of, (5.1) 


where the integration is extended over the whole range of values that can be 
taken by the quantity f. 

The subject of the normalization of the eigenfunctions of a continuous 
spectrum is more complex than in the case of a discrete spectrum. The 
requirement that the integral of the squared modulus of the function should 
be equal to unity cannot here be satisfied, as we shall see below. Instead, 
we try to normalize the functions i’; in such a way that |@;|* df is the prob- 
ability that the physical quantity concerned, in the state described by the 
wave function ¥’, has a value between f and f+df. Since the sum of the 
probabilities of all possible values of f must be equal to unity, we have 


fla’ af =1 (5.2) 


(similarly to the relation (3.3) for a discrete spectrum). 
Proceeding in exactly the same way as in the derivation of formula (3.5), 
and using the same arguments, we can write, firstly, 


fewe ag = flat 
and, secondly, 


| Wye dg = Í i a AY dfdg. 


By comparing these two expressions we find the formula which determines 
the expansion coefficients, 


a, = | EE") dg, (5.3) 


in exact analogy to (3.5). 
To derive the normalization condition, we now substitute (5.1) in (5.3), 
and obtain 


ay = f ap(S[¥ r ¥,* dq) df’. 


This relation must hold for arbitrary as, and therefore must be satisfied 
identically. For this to be so, it is necessary that, first of all, the coefficient 
of ay. in the integrand (i.e. the integral { ¥/¥;* dg) should be zero for 
all f’ #f. For f' = f, this coefficient must become infinite (otherwise the 
integral over f’ would vanish). Thus the integral f W,‘¥',* dg is a function 
of the difference f'—f, which becomes zero for values of the argument 
different from zero and is infinite when the argument is zero. We denote 
this function by 6(f’—/): 


f FF, dg = &(f’—f). (5.4) 
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The manner in which the function 8(f’ —f) becomes infinite for f'—f = 0 
is determined by the fact that we must have 


f 8(f’—f) apdf! = ar. 
It is clear that, for this to be so, we must have 
[sg af’ = 1. 


The function thus defined is called a delta function, and was first used in 
theoretical physics by P. A. M. Dirac. We shall write out once more the 


formulae which define it. They are 


d(x) = O for x #0, 6(0) = œ, (5.5) 
while 


fs) de= 1: (5.6) 


We can take as limits of integration any numbers such that x = 0 lies between 
them. If f(x) is some function continuous at x = 0, then 


| A dx = f(0). (5.7) 
This formula can be written in the more general form 


[ 8@—ayf(x) dx = fla), (5.8) 


where the range of integration includes the point x = a, and f(x) is continuous 
at x = a. It is also evident that 


d(—x) = d(x), (5.9) 
i.e. the delta function 1s even. Finally, pa 
J (ax) dx = | Cee 
lo] lel 
we can deduce that 
B(ax) = (1/]æl) (x) (5.10) 


where « is any constant. 

The formula (5.4) gives the normalization rule for the eigenfunctions of a 
continuous spectrum; it replaces the condition (3.6) for a discrete spectrum. 
We see that the functions Y; and Vy with f # f’ are, as before, orthogonal. 
However, the integrals of the squared moduli |‘Y;|? of the functions diverge 
for a continuous spectrum. 

The functions ¥;(q) satisfy still another relation similar to (5.4). To derive 
this, we substitute (5.3) in (5.1), which gives 


F(a) = | POS ENEA) df) dg, 


whence we can at once deduce that we must have 


[EPAIA of = 8(@’—9). (5.11) 
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There is, of course, an analogous relation for a discrete spectrum: 


D Farla Enla) = g'a) (5.12) 


Comparing the pair of formulae (5.1), (5.4) with the pair (5.3), (5.11), we 
see that, on the one hand, the function ¥(q) can be expanded in terms of 
the functions FY;(q) with expansion coefficients ay and, on the other hand, 
formula (5.3) represents an entirely analogous expansion of the function 
ay = a(f) in terms of the functions Y;*(q), while the ¥(q) play the part of 
expansion coefficients. The function a(f), like ¥(q), completely determines 
the state of the system; it is sometimes called a wave function in the f repre- 
sentation (while the function ¥(q) is called a wave function in the q representa- 
tion). Just as |Y(g)|* determines the probability for the system to have co- 
ordinates lying in a given interval dg, so |a(f)|? determines the probability for 
the values of the quantity f to lie in a given interval df. On the one hand, 
the functions Y ;(q) are the eigenfunctions of the quantity f in the g representa- 
tion; on the other hand, their complex conjugates are the eigenfunctions of 
the coordinate çg in the f representation. 

Let ¢(f) be some function of the quantity f, such that ¢ and f are related in 
a one-to-one manner. Each of the functions ¥,(q) can then be regarded as 
an eigenfunction of the quantity ¢. Here, however, the normalization of 
these functions must be changed: the eigenfunctions ‘",(g) of the quantity ¢ 
must be normalized by the condition 


[Eart ar" dg = S-A, 


whereas the functions Yp. are normalized by the condition (5.4). The argu- 
ment of the delta function becomes zero only for f’ = f. As f’ approaches f, 


we have 4( f')—A A) = [dd( Ndr] . (f’—f). By (5.10) we can therefore write 


1 
Af A) = iaar cai): (5.13) 


Comparing this with (5.4), we see that the functions Y4 and Yp are related 
by 


Yau) (5.14) 


a. 
Vide dsl 
There are also physical quantities which in one range of values have a 

discrete spectrum, and in another a continuous spectrum. For the eigen- 

functions of such a quantity all the relations derived in this and the previous 
sections are, of course, true. It need only be noted that the complete set 
of functions is formed by combining the eigenfunctions of both spectra. 


+ in general, if ¢(x) is some one-valued function (the inverse function need not be one- 
valued), we have 


1 
le] = > wee (5.13a) 


where a; are the roots of the equation ¢{x) = Q. 
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Hence the expansion of an arbitrary wave function in terms of the eigenfunc- 
tions of such a quantity has the form 


F(a) = Lan al) + | a£) df (5.15) 


where the sum is taken over the discrete spectrum and the integral over the 
whole continuous spectrum. 

The coordinate çq itself is an example of a quantity having a continuous 
spectrum. It is easy to see that the operator corresponding to it is simply 
multiplication by g. For, since the probability of the various values of the 
coordinate is determined by the square |Y(q)|2, the mean value of the 
coordinate is 


g = f Y|? dg = | P*g¥ dg. 
Comparison of this with the definition (3.8) of an operator shows thatt 


= 4. (5.16) 


The eigenfunctions of this operator must be determined, according to the 
usual rule, by the equation qF, = go F4, where gg temporarily denotes the 
actual values of the coordinate as distinct from the variable g. Since this 
equation can be satisfied either by ¥,, = 0 or by q = qo, it is clear that the 
eigenfunctions which satisfy the normalization condition aref{ 


Fa, = 8(9—%)- (5.17) 


§6. The passage to the limiting case of classical mechanics 


Quantum mechanics contains classical mechanics in the form of a certain 
limiting case. The question arises as to how this passage to the limit 1s 
made. 

In quantum mechanics an electron is described by a wave function which 
determines the various values of its coordinates; of this function we so far 
know only that it is the solution of a certain linear partial differential equation. 
In classical mechanics, on the other hand, an electron is regarded as a material 
particle, moving in a path which is completely determined by the equations 
of motion. There is an interrelation, somewhat similar to that between 
quantum and classical mechanics, in electrodynamics between wave optics 


+ In future we shall always, for simplicity, write operators which amount to multiplication 
by some quantity in the form of that quantity itself. 

t The expansion coefficients for an arbitrary function ¥ in terms of these eigenfunctions 
are 


ag, = | ¥(q)8(q—g0) dg = Flg). 
The probability that the value of the coordinate lies in a given interval dga is 
lag |* dgo = IC go)i? dyos 
as it should be. 
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and geometrical optics. In wave optics, the electromagnetic waves are 
described by the electric and magnetic field vectors, which satisfy a definite 
system of linear differential equations, namely Maxwell’s equations. In 
geometrical optics, however, the propagation of light along definite paths, or 
rays, is considered. Such an analogy enables us to see that the passage from 
quantum mechanics to the limit of classical mechanics occurs similarly to the 
passage from wave optics to geometrical optics. 

Let us recall how this latter transition is made mathematically (see Fields, 
§53). Let u be any of the field components in the electromagnetic wave. It 
can be written in the form u = ae‘? (with a and ¢ real), where a is called the 
amplitude and ¢ the phase of the wave (called in geometrical optics the 
eikonal). The limiting case of geometrical optics corresponds to small wave- 
lengths; this is expressed mathematically by saying that ¢ varies by a large 
amount over short distances; this means, in particular, that it can be 
supposed large in absolute value. 

Similarly, we start from the hypothesis that, to the limiting case of classical 
mechanics, there correspond in quantum mechanics wave functions of the 
form ‘ = ae'?, where a is a slowly varying function and ¢ takes large values. 
As is well known, the path of a particle can be determined in mechanics by 
means of the variational principle, according to which what is called the 
action S of a mechanical system must take its least possible value (the principle 
of least action). In geometrical optics the path of the rays is determined by 
what is called Fermat's principle, according to which the optical path length 
of the ray, i.e. the difference between its phases at the beginning and end of 
the path, must take its least (or greatest) possible value. 

On the basis of this analogy, we can assert that the phase ¢ of the wave 
function, in the limiting (classical) case, must be proportional to the mech- 
anical action S of the physical system considered, i.e. we must have 
S = constant ¢. The constant of proportionality is called Planck’s con- 
stantt and is denoted by A. It has the dimensions of action (since ¢ is 
dimensionless) and has the value 


h = 1-054 x 10-*? erg sec. 


Thus, the wave function of an ‘‘almost classical” (or, as we say, quasi- 
classical) physical system has the form 


Y = ag [nh _ (6.1) 


Planck’s constant A plays a fundamental part in all quantum phenomena. 
Its relative value (compared with other quantities of the same dimensions) 
determines the “extent of quantization” of a given physical system. The 
transition from quantum mechanics to classical mechanics, corresponding to 
large phase, can be formally described as a passage to the limit A —> 0 (just 


+ It was introduced into physics by M. Planck in 1900. The constant A, which we use 
everywhere in this book, is, strictly speaking, Planck’s constant divided by 27; this is Dirac’s 
notation. 
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as the transition from wave optics to geometrical optics corresponds to a 
passage to the limit of zero wavelength, A —> 0). 

We have ascertained the limiting form of the wave function, but the 
question still remains how it is related to classical motion in a path. In 
general, the motion described by the wave function does not tend to motion 
in a definite path. Its connection with classical motion is that, if at some 
initial instant the wave function, and with it the probability distribution of 
the coordinates, is given, then at subsequent instants this distribution will 
change according to the laws of classical mechanics (for a more detailed dis- 
cussion of this, see the end of §17). 

In order to obtain motion in a definite path, we must start from a wave 
function of a particular form, which is perceptibly different from zero only 
in a very small region of space (what is called a wave packet); the dimensions 
of this region must tend to zero with A. Then we can say that, in the quasi- 
classical case, the wave packet will move in space along a classical path of a 
particle. 

Finally, quantum-mechanical operators must reduce, in the limit, simply 
to roultiplication by the corresponding physical quantity. 


§7. The wave function and measurements 


Let us again return to the process of measurement, whose properties have 
been qualitatively discussed in §1; we shall show how these properties are 
related to the mathematical formalism of quantum mechanics. 

We consider a system consisting of two parts: a classical apparatus and 
an electron (regarded as a quantum object). The process of measurement 
consists in these two parts’ coming into interaction with each other, as a 
result of which the apparatus passes from its initial state into some other; 
from this change of state we draw conclusions concerning the state of the 
electron. The states of the apparatus are distinguished by the values of some 
physical quantity (or quantities) characterizing it-—the “readings of the ap- 
paratus”. We conventionally denote this quantity by g, and its eigenvalues 
by g,; these take in general, in accordance with the classical nature of the 
apparatus, a continuous range of values, but we shall—merely in order to 
simplify the subsequent formulae—suppose the spectrum discrete. The 
states of the apparatus are described by means of quasi-classical wave func- 
tions, which we shall denote by ©,(&), where the suffix n corresponds to the 
“reading” gn of the apparatus, and € denotes the set of its coordinates. The 
classical nature of the apparatus appears in the fact that, at any given instant, 
we can say with certainty that it is in one of the known states ®, with some 
definite value of the quantity g; for a quantum system such an assertion 
would, of course, be unjustified. 

Let ®,(€) be the wave function of the initial state of the apparatus (before 
the measurement), and ‘f’(g) some arbitrary normalized initial wave function 
of the electron (q denoting its coordinates). These functions describe the 
state of the apparatus and of the electron independently, and therefore the 
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initial wave function of the whole system is the product 


E(q) PolE). (7.1) 


Next, the apparatus and the electron interact with each other. Applying the 
equations of quantum mechanics, we can in principle follow the change of 
the wave function of the system with time. After the measuring process it 
may not, of course, be a product of functions of € and q. Expanding the 
wave function in terms of the eigenfunctions ®,, of the apparatus (which 
form a complete set of functions), we obtain a sum of the form 


E An(g)Px(6) (7.2) 


where the A,(¢) are some functions of q. 

The classical nature of the apparatus, and the double role of classical 
mechanics as both the limiting case and the foundation of quantum mechanics, 
now make their appearance. As has been said above, the classical nature of 
the apparatus means that, at any instant, the quantity g (the “reading of the 
apparatus”) has some definite value. This enables us to say that the state 
of the system apparatus -+ electron after the measurement will in actual fact 
be described, not by the entire sum (7.2), but by only the one term which 
corresponds to the “reading” g,, of the apparatus, 


A,(q)Px({€). (7.3) 


It follows from this that An(g) is proportional to the wave function of the 
electron after the measurement. It is not the wave function itself, as is seen 
from the fact that the function An(q) is not normalized. It contains both 
information concerning the properties of the resulting state of the electron 
and the probability (determined by the initial state of the system) of the 
occurrence of the mth “reading” of the apparatus. 

Since the equations of quantum mechanics are linear, the relation between 
An(q) and the initial wave function of the electron ¥(q) is in general given by 
some linear integral operator: 


Aig) = Í Kala, PFG’) da’, (7.4) 


with a kernel Ka(q, 9’) which characterizes the measurement process con- 
cerned. 

We shall suppose that the measurement concerned is such that it gives a 
complete description of the state of the electron. In other words (see §1), 
in the resulting state the probabilities of all the quantities must be indepen- 
dent of the previous state of the electron (before the measurement). Mathe- 
matically, this means that the form of the functions A,(g) must be determined 
by the measuring process itself, and does not depend on the initial wave 
function ¥(q) of the electron. Thus the A, must have the form 


A,(q) = anPr(Q)s (7.5 ) 
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where the ¢» are definite functions, which we suppose normalized, and only 
the constants an depend on F(q). In the integral relation (7.4) this corresponds 
to a kernel K;,,(g, g') which is a product of a function of g and a function of gq’: 


Kanla, 7’) = $n Enl’); (7.6) 


then the linear relation between the constants an and the function ¥(q) is 
a, = | FE") dg, (7.7) 


where the ‘Y’,(g) are certain functions depending on the process of measure- 
ment. 

The functions ġn(q) are the normalized wave functions of the electron after 
measurement. Thus we see how the mathematical formalism of the theory 
reflects the possibility of finding by measurement a state of the electron 
described by a definite wave function. 

If the measurement 1s made on an electron with a given wave function 
Y(q), the constants a, have a simple physical meaning: in accordance with 
the usual rules, |a,|* is the probability that the measurement will give the 
nth result. The sum of the probabilities of all results is equal to unity: 


£ Jan? = 1. (7.8) 


In order that equations (7.7) and (7.8) should hold for an arbitrary nor- 
malized function ¥(q), it is necessary (cf. §3) that an arbitrary function ¥'(q¢) 
can be expanded in terms of the functions Y’,(g). This means that the 
functions ¥',(q) form a complete set of normalized and orthogonal functions. 

If the initial wave function of the electron coincides with one of the func- 
tions ¥,(q), then the corresponding constant a, is evidently equal to unity, 
while all the others are zero. In other words, a measurement made on an 
electron in the state W,(q) gives with certainty the nth result. 

All these properties of the functions ¥,(q) show that they are the eigen- 
functions of some physical quantity (denoted by f ) which characterizes the 
electron, and the measurement concerned can be spoken of as a measurement 
of this quantity. 

It is very important to notice that the functions ¥,„(q) do not, in general, 
coincide with the functions ¢,,(q); the latter are in general not even mutually 
orthogonal, and do not form a set of eigenfunctions of any operator. This 
expresses the fact that the results of measurements in quantum mechanics 
cannot be reproduced. If the electron was in a state ‘Y,,(q), then a measure- 
ment of the quantity f carried out on it leads with certainty to the value fa- 
After the measurement, however, the electron is in a state ¢,(q) different 
from its initial one, and in this state the quantity f does not in general take 
any definite value. Hence, on carrying out a second measurement on the 
electron immediately after the frst, we should obtain for f a value which did 
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not agree with that obtained from the first measurement.f To predict (in the 
sense of calculating probabilities) the result of the second measurement from 
the known result of the first, we must take from the first measurement the 
wave function ¢,(q) of the state in which it resulted, and from the second 
measurement the wave function Y,,(¢) of the state whose probability is re- 
quired. This means that from the equations of quantum mechanics we deter- 
mine the wave function ¢,(q, t) which, at the instant when the first measure- 
ment is made, is equal to¢,(g); the probability of the mth result of the second 
measurement, made at time £, is then given by the squared modulus of the 
integral f ¢,(9, t)Fm*(g) dg. 

We see that the measuring process in quantum mechanics has a “‘two- 
faced” character: it plays different parts with respect to the past and future 
of the electron. With respect to the past, it “verifies” the probabilities of the 
various possible results predicted from the state brought about by the previ- 
ous measurement. With respect to the future, it brings about a new state 
(see also §44). Thus the very nature of the process of measurement involves 
a far-reaching principle of irreversibility. 

This irreversibility is of fundamental significance. We shall see later (at 
the end of §18) that the basic equations of quantum mechanics are in them- 
selves symmetrical with respect to a change in the sign of the time; here 
quantum mechanics does not differ from classical mechanics. The irrever- 
sibility of the process of measurement, however, causes the two directions 
of time to be physically non-equivalent, i.e. creates a difference between the 
future and the past. 





+ There is, however, an important exception to the statement that results of measurements 
cannot be reproduced: the one quantity the result of whose measurement can be exactly 
reproduced is the coordinate. Two measurements of the coordinates of an electron, made at 
a sufficiently small interval of time, must give neighbouring values; if this were not so, it 
would mean that the electron had an infinite velocity. Mathematically, this is related to the 
fact that the coordinate commutes with the operator of the interaction energy between the 
electron and the apparatus, since this energy is (in non-relativistic theory) a function of the 
coordinates only. k 


CHAPTER II 


ENERGY AND MOMENTUM 


§8. The Hamiltonian operator 


THE wave function ‘ completely determines the state of a physical system 
in quantum mechanics. This means that, if this function is given at some 
instant, not only are all the properties of the system at that instant described, 
but its behaviour at all subsequent instants is determined (only, of course, to 
the degree of completeness which is generally admissible in quantum mech- 
anics). The mathematical expression of this fact is that the value of the deri- 
vative 3F/ðt of the wave function with respect to time at any given instant 
must be determined by the value of the function itself at that instant, and, 
by the principle of superposition, the relation between them must be linear. 
In the most general form we can write 


ih Fet = AY, (8.1) 
where Ĥ is some linear operator; the factor zi is introduced here for a reason 


that will become apparent. 
Since the integral [F®F dg is a constant independent of time, we have 





< f epa = | Z va ef y ana 
di $ T e 


Substituting here (8.1) and using in the first integral the definition of the 
transpose of an operator, we can write (omitting the common factor 1/h) 


a fad fad 


[Hers dg - | YH dg = fester dg — [AF dg 
=l W*(H*— A) dq = 0. 


Since this equation must hold for an arbitrary function Y, it follows that we 
must have identically H+ = H; the operator H is therefore Hermitian. Let 
us find the physical quantity to which it corresponds. To do this, we use 
the limiting expression (6.1) for the wave function and write 

OY i aS 

ath at 
the slowly varying amplitude a need not be differentiated. Comparing this 


equation with the definition (8.1), we see that, in the limiting case, the 
operator H reduces to simply multiplying by —¢S/ét. This means that 
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—€S/ét is the physical quantity into which the Hermitian operator H 
passes. 

The derivative —CS/Ct is just Hamilton’s function H for a mechanical 
system. Thus the operator H is what corresponds in quantum mechanics 
to Hamilton’s function; this operator 1s called the Hamiltonian operator or, 
more briefly, the Hamiltonian uf the system. If the form of the Hamiltonian 
is known, equation (8.1) determines the wave functions of the phvsical 
system concerned. This fundamental equation of quantum mechanics is 
called the wave equation 


§9. The differentiation of operators with respect to time 

The concept of the derivative of a physical quantity with respect to time 
cannot be defined in quantum mechanics in the same way as in classical mech- 
anics. For the definition of the derivative in classical mechanics involves 
the consideration of the values of the quantity at two neighbouring but 
distinct instants of time. In quantum mechanics, however, a quantity which 
at some instant has a definite value does not in general have definite values at 
subsequent instants; this was discussed in detail in §1. 

Hence the idea of the derivative with respect to time must be differently 
defined in quantum mechanics. It is natural to define the derivative f of a 
quantity f as the quantity whose mean value is equal to the derivative, with 
respect to time, of the mean value f. Thus we have the definition 


fof (9.1) 
Starting from this definition, it 1s easy to obtain an expression for the 


quantum-mechanical operator f corresponding to the quantity f: 


f=f= £ fiera -= fera $ [Era + [rya 
“C dt a ae at ? ae 
Here 0//dt is the operator obtained by differentiating the operator f with 


respect to time; f may depend on the time as a parameter. Substituting for 
oY /ot, o*/dt their expressions according to (8.1), we obtain 


of í k 1 $ 
f= fja SY agt: | (A ye) f ag | ¥ KAY) de. 
Since the operator Ĥ is Hermitian, we have 
Í (AF) fF) dg = f Vf dg: 
thus 
f= ll ve(La infin) dg 
a h` h l 


_ Since, on the other hand, we must have, by the definition of mean values, 
f=JW¥*f¥ dg, it is seen that the expression in parentheses in the inte- 
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grand is the required operator ft 
of i 
= —+-(Af—fA). 9.2 
f= = + (Af—/f) (9.2) 


If the operator f is independent of time, ri reduces, apart from a constant 
factor, to the commutator of the operator f and the Hamiltonian. 

A very important class of physical quantities is formed by those whose 
operators do not depend explicitly on time, and also commute with the 
Hamiltonian, so that f = 0. Such quantities are said to be conserved. For 
these f = f = 0, that is, f is constant. In other words, the mean value of 
the quantity remains constant in time. We can also assert that, if in a given 
state the quantity f has a definite value (i.e. the wave function is an eigen- 
function of the operator f ), then it will have a definite value (the same one) 
at subsequent instants also. 


§10. Stationary states 


The Hamiltonian of a closed system (and of a system in a constant external 
field) cannot contain the time explicitly. This follows from the fact that, 
for such a system, all times are equivalent. Since, on the other hand, any 
operator of course commutes with itself, we reach the conclusion that 
Hamilton's function is conserved for systems which are not in a varying 
external field. As is well known, a Hamilton’s function which is conserved 
is called the energy. The law of conservation of energy in quantum mecha- 
nics signifies that, if in a given state the energy has a definite value, this 
value remains constant in time. 


+ In classical mechanics we have for the total derivative, with respect to time, of a quantity 
f which is a function of the generalized coordinates qi and momenta pi of the system 


df of > fe -of 

de 8 (Gat apt) 
Substituting, ın accordance with Hamilton’s equations, gi = 6H/pi and pi = —@H/ege, we 
obtain 


dfidt = effer+([H./], 


of ƏH əf OH 
nyA- 
: qi OP; pi dqu 
is what is called the Potsson bracket for the quantities f and H (see Mechanics, §42). On 
comparing with the expression (9.2), we see that, as we pass to the limit of classical mechanics, 
the operator i(f7/—f H) reduces in the first approximation to zero, as it should, and in the 
second approximation (with respect to fi) to the quantity A[H, f]. This result is true also 


for any two quantities f and g; the operator i( fe —éf ) tends in the limit to the quantity 
A{ f, g], where [ f, g] is the Poisson bracket 


Og Of og of 
l= ew. igs aa 
; Qs Pi Op; 095 


This follows from the fact that we can always formally imagine a system whose Hamiltonian | 
is £. 


where 
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States in which the energy has definite values are called stationary states 
of a system. They are described by wave functions ¥, which are the eigen- 
functions of the Hamiltonian operator, i.e. which satisfy the equation 
HY, = E,,¥,,, where E, are the eigenvalues of the energy. Correspondingly, 
the wave equation (8.1) for the function ¥’,,, 


ih ov ,,/ot = AY, = EY, 
can be integrated at once with respect to time and gives 
Tn Sn IME ntng), (10.1) 


where yp is a function of the coordinates only. This determines the relation 
between the wave functions of stationary states and the time. 

We shall denote by the small letter ¢ the wave functions of stationary states 
without the time factor. ‘These functions, and also the eigenvalues of the 
energy, are determined by the equation 


Ĥy = Ey. (10.2) 


The stationary state with the smailest possible value of the energy is called 
the normal or ground state of the system. 

The expansion of an arbitrary wave function in terms of the wave func- 
tions of stationary states has the form 


F = E ange ME nty,,(q). (10.3) 


The squared moduli |a,,|? of the expansion coefficients, as usual, determine 
the probabilities of various values of the energy of the system. 

The probability distribution for the cuurdinates in a stationary state is 
determined by the squared modulus |, |? = |y,,|?; we see that it is indepen- 
dent of time. The same is true of the mean values 


f=] ¥itf¥n dg = f dathln dg 


of any physical quantity f (whose operator does not depend explicitly on the 
time). 

As has been said, the operator of any quantity that is conserved commutes 
with the Hamiltonian. This means that any physical quantity that is con- 
served can be measured simultaneously with the energy. 

Among the various stationary states, there may be some which correspond 
to the same value of the energy (the same energy level of the system), but 
differ in the values of some other physical quantities. Such energy levels, 
to which several different stationary states correspond, are said to be 
degenerate. Physically, the possibility that degenerate levels can exist is 
related to the fact that the energy does not in general form by itself a com- 
plete set of physical quantities. 

If there are two conserved physical quantities f and g whose operators do 
not commute, then the energy levels of the system are in general degenerate. 
For, let y be the wave function of a stationary state in which, besides the 
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energy, the quantity f also has a definite value. Then we can say that the 
function £¢ does not coincide (apart from a constant factor) with 4; if it did, 
this would mean that the quantity g also had a definite value, which is 
impossible, since f and g cannot be measured simultaneously. On the other 
hand, the function gy is an eigenfunction of the Hamiltonian, corresponding 
to the same value E of the energy as wb: 


Agy) =A = Egy). 


Thus we see that the energy E corresponds to more than one eigenfunction, 
i.e. the energy level is degenerate. 

It is clear that any linear combination of wave functions corresponding 
to the same degenerate energy level is also an eigenfunction for that value of 
the energy. In other words, the choice of eigenfunctions of a degenerate 
energy level is not unique. Arbitrarily selected eigenfunctions of a degener- 
ate energy level are not, in general, orthogonal. By a proper choice of linear 
combinations of them, however, we can always obtain a set of orthogonal 
(and normalized) eigenfunctions (and this can be done in infinitely many 
ways; for the number of independent coefficients in a linear transformation 
of n functions is m2, while the number of normalization and orthogonality 
conditions for n functions is $7(n-+1), i.e. less than nê). 

These statements concerning the eigenfunctions of a degenerate energy 
level relate, of course, not only to eigenfunctions of the energy, but also to 
those of any operator. Only those functions are automatically orthogonal 
which correspond to different eigenvalues of the operator concerned; 
functions which correspond to the same degenerate eigenvalue are not in 
general orthogonal. 

If the Hamiltonian of the system is the sum of two (or more) parts, 
A = Ay + Ê, one of which contains only the coordinates qı and the other 
only the coordinates g2, then the eigenfunctions of the operator A can be 
written down as products of the eigenfunctions of the operators Hı and Ae, 
and the eigenvalues of the energy are equal to the sums of the eigenvalues of 
these operators. 

The spectrum of eigenvalues of the energy may be either discrete or 
continuous. A stationary state of a discrete spectrum always corresponds to 
a finite motion of the system, i.e. one in which neither the system nor any 
part of it moves off to infinity. For, with eigenfunctions of a discrete spec- 
trum, the integral f ||? dg, taken over all space, is finite. ‘This certainly 
means that the squared modulus ||? decreases quite rapidly, becoming 
zero at infinity. In other words, the probability of infinite values of the co- 
ordinates is zero; that is, the system executes a finite motion, and is said to 
be in a bound state. 

For wave functions of a continuous spectrum, the integral f |‘t"|® dg diverges. 
Here the squared modulus |¥|? of the wave function does not directly deter- 
mine the probability of the various values of the coordinates, and must be 
regarded only as a quantity proportional to this probability. The divergence 
of the integral f ||? dq is always due to the fact that |F|? does not become 
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zero at infinity (or becomes zero insufficiently rapidly). Hence we can say 
that the integral f ||? dg, taken over the region of space outside any arbi- 
trarily large but finite closed surface, will always diverge. This means that, 
in the state considered, the system (or some part of it) is at infinity. For a 
wave function which is a superposition of the wave functions of various 
stationary states of a continuous spectrum, the integral f IF]? dg may 
converge, so that the system lies in a finite region of space. However, in the 
course of time, this region moves unrestrictedly, and eventually the system 
moves off to infinity. This can be seen as follows. Any superposition of 
wave functions of a continuous spectrum has the form 


Y =: f a pe INEU, e(a) dE. 


The squared modulus of ¥ can be written in the form of a double integral: 
p= ff a pa pe MEE eg) e*l) GEE’. 


If we average this expression over some time interval T, and then let T tend 
to infinity, the mean values of the oscillating factors e(@/M(E" -E)t, and there- 
fore the whole integral, tend to zero in the limit. Thus the mean value, 
with respect to time, of the probability of finding the system at any given 
point of configuration space tends to zero. This is possible only if the 
motion takes place throughout infinite space.t Thus the stationary states of 
a continuous spectrum correspond to an infinite motion of the system. 


§11. Matrices 

We shall suppose for convenience that the system considered has a discrete 
energy spectrum; all the relations obtained below can be generalized at once 
to the case of a continuous spectrum. Let ¥ = Za,‘¥,, be the expansion of 
an arbitrary wave function in terms of the wave functions ¥, of the stationary 
states. If we substitute this expansion in the definition (3.8) of the mean 
value of some quantity f, we obtain 


T= E2 ee (11.1) 
where f,,,,(f) denotes the integral 
Ffam(t) = [eee dg. (11.2) 


The set of quantities fnm(t) with all possible n and m is called the matrix of the 


+ Note that, for a function ¥ which is a superposition of functions of a discrete spectrum, 
we should have 


[FI = EE agate Em Eat pm = E antala)’, 
nm n 


i.e. the probability density remains finite on averaging over time. 
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quantity f, and each of the fnm(t) is called the matrix element corresponding 
to the transition from state m to state n.t 
The dependence of the matrix elements fam(t) on time is determined (if 

the operator f does not contain the time explicitly) by the dependence of the 
functions ¥, on time. Substituting for them the expressions (10.1), we find 
that | 

fant) = fame nmt, (11.3) 
where 

EEEN E E, (11.4) 


is what is called the transition frequency between the states n and m, and the 
quantities 


fam = | bn*hbm dg (11.5) 


form the matrix of the quantity f which is independent of time, and which 
is commonly used.] 

The matrix elements of the derivative f are obtained by differentiating the 
matrix elements of the quantity f with respect to time; this follows directly 
from the fact that 


f=} = EÈ an*anfum(t) (11.6) 
From (11.3) we thus have for the matrix elements of f 
fant) = onnen (11.7) 


or (cancelling the time factor e*!»nt from both sides) for the matrix elements 
independent of time 


G2 ape = Sea ae = (i/A{En— Em) fam (11.8) 


To simplify the notation in the formulae, we shall derive all our relations 
below for the matrix elements independent of time; exactly similar relations 
hold for the matrices which depend on the time. 

For the matrix elements of the complex conjugate f* of the quantity f we 
obtain, taking into account the definition of the Hermitian conjugate operator, 


(Fam = | Wf tm dg = | paf ym dg = | mf "Yn" dg 
or 


(f*) nm = (fmn)*- (11.9) 


For real physical quantities, which are the only ones we usually consider, 


+ The matrix representation of physical quantities was introduced by Heisenberg in 1925, 
before Schrédinger’s discovery of the wave equation. ‘Matrix mechanics’ was later de- 
veloped by MI. Born, W. Hetsenberg and P. Jordan. 

t Because of the indeterminacy of the phase factor in normalized wave functions (see §2), 
the matrix elements fam (and fnm(t)) also are determined only to within a factor of the form 
etm n). Here again this indeterminacy has no effect on any physical results. 
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we consequently have 
Fan = fmn" (11.10) 


(fmn* stands for (fmn)*)- Such matrices, like the corresponding operators, 
are said to be Hermitian. 

Matrix elements with # = m are called diagonal elements. These are 
independent of time, and (11.10) shows that they are real. The element fnn 
is the mean value of the quantity f in the state Wp 

It is not difficult to obtain the ‘multiplication rule” for matrices. To do 
so, we first observe that the formula 


Nn = E f mnt (11.11) 


holds. This is simply the expansion of the function ff, in terms of the func- 
tions fm, the coefficients being determined in accordance with the general 
formula (3.5). Remembering this formula, let us write down the result of 
the product of two operators acting on the function p,p: 


félin = fl bpn) = f E gine = © gin fhr = X Einf mhm 


Since, on the other hand, we must have 
fen = = Cle) wai ons 


we arrive at the result that the matrix elements of the product fg are deter- 
mined by the formula 
(LE) mn = ES men: (11.12) 


This rule is the same as that used in mathematics for the multiplication of 
matrices: the rows of the first matrix in the product are multiplied by the 
columns of the second matrix. 


If the matrix is given, then so is the operator itself. In particular, if the 
matrix is given, it is in principle possible to determine the eigenvalues of the 
phvsical quantity concerned and the corresponding eigenfunctions. 

We shall now consider the values of all quantities at some definite instant, 
and expand an arbitrary wave function ¥ (at that instant) in terms of the 
eigenfunctions of the Hamiltonian, i.e. of the wave functions ym of the 
stationary states (these wave functions are independent of time). 


Y= Deila (11.13) 


where the expansion coefhcients are denoted by cm. We substitute this expan- 
sion in the equation f¥ = fF which determines the eigenvalues and eigen- 
functions of the quantity f. We have 


E Cm( Mn) =S E Ci 


We multiply both sides of this equation by y,,* and integrate over g. Each 
of the integrals f ,,*fus,, dg on the left-hand side of the equation is the cor- 
responding matrix element fam- On the right-hand side, all the integrals 
f Yn*Ym dg with m # n vanish by virtue of the orthogonality of the functions 
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Ym, and f pn*%n dg = 1 by virtue of their normalization.t Thus 


D famem = fen, (11.14) 


or 
2 (fnm—SPnmdem as 0, 


where 64,,,, = 0 form #nand = 1 form=n. 

Thus we have obtained a system of homogeneous algebraic equations of 
the first degree (with the c,, as unknowns). As is well known, such a system 
has solutions different from zero only if the determinant formed by the 
coefficients in the equations vanishes, i.e. only if 


| fnm—f9 nmol = 0. (11.15) 


The roots of this equation (in which f is regarded as the unknown) are the 
possible values of the quantity f. The set of values c,, satisfying the equations 
(11.14) when f is equal to any of these values determines the corresponding 
eigenfunction. 

If, in the definition (11.5) of the matrix elements of the quantity f, we take 
as ,, the eigenfunctions of this quantity, then from the equation ffn = fyn 
we have 


Sam = EAA dg = fm f Yn bm dg. 


By virtue of the orthogonality and normalization of the functions ym, this 
gives fnm = 0 for n # m and fmm = fm. Thus only the diagonal matrix 
elements are different from zero, and each of these is equal to the correspond- 
ing eigenvalue of the quantity f. A matrix with only these elements different 
from zero is said to be put in diagonal form. In particular, in the usual 
representation, with the wave functions of the stationary states as the functions 
fn, the energy matrix is diagonal (and so are the matrices of all other physical 
quantities having definite values in the stationary states). In general, the 
matrix of a quantity f, defined with respect to the eigenfunctions of some 
operator 8, is said to be the matrix of f in a representation in which g is diagonal. 
We shall always, except where the subject is specially mentioned, understand 
in future by the matrix of a physical quantity its matrix in the usual repre- 
sentation, in which the energy is diagonal. Everything that has been said 
above regarding the dependence of matrices on time refers, of course, only 
to this usual representation. Í 





t In accordance with the general rule ($5), the set of coefficients cn in the expansion (11.13) 
can be considered as the wave function in the “energy representation” (the variable being 
the sufhx n that gives the number of the energy eigenvalue). The matrix fam here acts as 
the operator f in this representation, the action of which on the wave function is given by 
the left-hand side of (11.14) The formula f = LE c¥(fnmcm) then corresponds to the general 
expression for the mean value of a quantity in terms of its operator and the wave function of 
the state concerned. 

Í Bearing in mind the diagonality of the energy matrix, it is easy to see that equation (11.8) 
is the operator relation (9 2) written in matrix form 
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By means of the matrix representation of operators we can prove the 
theorem mentioned in §4: if two operators commute with each other, they 
have their entire sets of eigenfunctions in common. Let f and g be two 
such operators. From ff = gf and the matrix multiplication rule (11.12), 
it follows that 


Z f mr8kn = È 8 mifen- 


If we take the eigenfunctions of the operator f as the set of functions y, with 
respect to which the matrix elements are calculated, we shall have fp, = 0 
for m + k, so that the above equation reduces to fmm8mn = 2mnfanr OF 


Emn fm fn) = 0. 


If all the eigenvalues f„ of the quantity f are different, then for all m 4 n we 
have fm—fn # 0, so that we must have g,,, = 0. Thus the matrix g,,,, is 
also diagonal, i.e. the functions y, are eigenfunctions of the physical quantity 
g also. If, among the values fa, there are some which are equal (i.e. if there 
are eigenvalues to which several different eigenfunctions correspond), then 
the matrix elements g,,,, corresponding to each such group of functions W, 
are, in general, different from zero. However, linear combinations of the 
functions ys, which correspond to a single eigenvalue of the quantity f are 
evidently also eigenfunctions of f, one can always choose these combinations 
in such a way that the corresponding non-diagonal matrix elements g,,,, are 
zero, and thus, in this case also, we obtain a set of functions which are 
simultaneously the eigenfunctions of the operators f and å. 
The following formula is useful.in applications: 


(GH/8\)nn = IBON, (11.16) 


where À is a parameter on which the Hamiltonian A (and therefore the 
energy eigenvalues Ep) depends. It is proved as follows. Differentiating 
the equation (H — En)ẹn = 0 with respect to A and then multiplying on the 
left by %n*, we obtain 


. A AEn ô 
TERO = (= - 5) , 
ua ES, á a an)" 





On integration with respect to q, the left-hand side gives zero, since 





| oar By he e | ZENA- By) dn dg, 


the operator H being Hermitian. The right-hand side gives the required 
equation. 
A widely used notation (introduced by Dirac) in recent literature is that 
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which denotes the matrix elements fnm byt 


<n| f |m). GLI 


This symbol is written so that it may be regarded as “‘consisting” of the 
quantity f and the symbols |m and <n| which respectively stand for the 
initial and final states as such (independently of the representation of the 
wave functions of the states). With the same symbols we can construct 
notations for the expansion coefficients of wave functions: if there is a 
complete set of wave functions corresponding to the states |), |22), .-. , 
the coefficients in the expansion in terms of these of the wave function of a 
state |m> are denoted by 


<nilm> = | dnc bm dg. (11.18) 


§12. Transformation of matrices 


The matrix elements of a given physical quantity can be defined with 
respect to various sets of wave functions, for example the wave functions of 
stationary states described by various sets of physical quantities, or the wave 
functions of stationary states of the same system in various external fields. 
The problem therefore arises of the transformation of matrices from one 
representation to another. 

Let Wn(q) and n'(g) (n = 1, 2,...) be two complete sets of orthonormal 
functions, related by some linear transformation: 


Yn’ = E Smabm, (12.1) 


which is simply an expansion of the function n’ in terms of the complete set 
of functions %. This transformation may be conventionally written in the 
operator form 


Yn’ = Sin. (12.2) 


The operator S must satisfy a certain condition in order that the functions 
Yn should be orthonormal if the functions %, are. Substituting (12.2) in 
the condition 


Í tm *Yin' dg = mn, 
and using the definition of the transposed operator (3.14), we have 


f (Spn) S "pm dq = f pmts Syn dg = Smn- 


If these equations hold for all m and n, we must have $*$ = 1, or 


Se = $+ = §-1, (12.3) - 





——. 


+ Both notations are used in the present book. The form (11.17) is especially convenient 
when each suffix has to be written as several letters. 
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i.e. the inverse operator 1s equal to the Hermitian conjugate operator. 
Operators having this property are said to be umitarv. Owing to this property, 
the transformation Yn = Syn’ inverse to (12.1) is given by 


Yn = = Sam*bm'. (12.4) 

Writing the equations $+S = 1 and SS+ = 1 in matrix form, we obtain 
the following forms of the unitarity condition: 

D Stm* Sin za Ômn, (12:5) 

È Smi* Sn == EPA (12.6) 


Let us now consider some physical quantity f and write down its matrix 
elements in the “new” representation, i.e. with respect to the functions wy’. 
These are given by the integrals 


f Ym” fun’ dg = | (Sem fin) dg 


= | bint S* fSyn dg 


= | hmt S Syn dg. 
Hence we see that the matrix of the operator f in the new representation is 
equal to the matrix of the operator 
fi = S-f$ (12.7) 


in the old representation.t 
The sum of the diagonal elemerts of a matrix is called the trace or spur] 
of the matrix and denoted by tr 7: 


trf = E fan (12.8) 


It may be noted first of all that the trace of a product of two matrices is 
:ndependent of the order of multiplication: 


tr (fg) = tr (gf), (12.9) 
t If {f, 8} = —ifé is the commutation rule for two operators f and £, the transformation 
(12.7) gives {f’, £ } = —ihé’, i.e. the rule is unchanged. We have shown in the footnote 


in §9 that ¢ ıs the quantum analogue of the classical Poisson bracket [f, g]. In classical 
mechanics, however, the Poisson brackets are invariant under canonical transformations of 
the variables (generalized coordinates and momenta); see Mechanics, §45. In this sense we 
can say that unitary transformations in quant..n mechanics play a role analogous to that of 
canonical transformations in classical mecha- ss. 

1 From the German word Spur. The no*i on sp f is also used. The trace can be defined, 
of course, only if the sum over n is convery 


§13 The Heisenberg representation of operators 37 
since the rule of matrix multiplication gives 


tr (fg) = E E fak8kn = È Y 8rnfak = tr (gf). 


Similarly we can easily see that, for a product of several matrices, the trace 
is unaffected by a cyclic permutation of the factors; for example, 


tr (fgh) = tr (hfg) = tr (ghf). (12.10) 


An important property of the trace is that it does not depend on the choice 
of the set of functions with respect to which the matrix elements are defined, 
since 


(tr fy = tr (S-YfS) = mSS Y= tr f. (12.11) 


A unitary transformation leaves unchanged the sum of the squared 
moduli of the functions that are transformed: from (12.6) we have 


2 | pa’? Zhe Skipe Sutyi" = e Prp” òk = X | Pal? (12.12) 
Any unitary operator may be written-as 
§ = ek, (12.13) 
where Ê is an Hermitian operator: since R+ = R, we have 


S+ = p-tR+ = Pe: = S-i, 


The expansion 
F = Sa fS = f+{f iR y+ AiR} iR}+ ... (12.14) 


is easily verified by a direct expansion o‘ the factors exp ( + iÊ) in powers of 
R. This expansion may be useful when Ê is proportional to a small para- 
meter, so that (12.14) becomes an expansion in powers of the parameter. 


§13. The Heisenberg representation of operators 


In the mathematical formalism of quantum mechanics described here, the 
operators corresponding to various physical quantities act on functions of the 
coordinates and do not usually depend explicitly on time. The time depen- 
dence of the mean values of physical quantities is due only to the time 
dependence of the wave function of the state, according to the formula 


A) = [ Y*a, Nf El, t) dg. (13.1) 


The quantum-mechanical treatment can, however, be formulated also in a 
somewhat different but equivalent form, in which the time dependence is 
transferred from the wave functions to the operators. Although we shall not 
use this Heisenberg representation (as opposed to the Schrödinger represen- 
tation) of operators in the present volume, a statement of it is given here with 
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a view to applications in the relativistic theory. 
We define the operator (which is unitary; see (12.13)) 


S = exp[—(i/A)H1), (13.2) 


where Ĥ is the Hamiltonian of the system. By definition, its eigenfunctions 
are the same as those of the operator Ĥ, i.e. the stationary-state wave functions 


Wala), where 
SYn(q) = eM Enthn(q). (13.3) 


Hence it follows that the expansion (10.3) of an arbitrary wave function 
Y in terms of the stationary-state wave functions can be written in the 
operator form 

F(q, t) = S'¥(q, 0), (13.4) 


i.e. the effect of the operator $ is to convert the wave function of the system 
at some initial instant into the wave function at an arbitrary instant. 
Defining, as in (12.7), the time-dependent operator 


f(t) = S-YfS, (13.5) 


we have 
fie) = | Erla OFE (a, 0) dg, (13.6) 


and thus obtain the formula (3.8) for the mean value of the quantity fin a 
form in which the time dependence is entirely transferred to the operator (for 
our definition of an operator rests on formula (3.8)). 

It is evident that the matrix elements of the operator (13.5) with respect 
to the stationary-state wave functions are the same 2s the time-dependent 
matrix elements fnm(t) defined by formula (11.3). 

Finally, differentiating the expression (13.5) with respect to time (assuming 
that the operators f and Ĥ do not themselves involve t), we obtain 


a. gine 5 
Eio = HAJO - JOA (13.7) 


which is similar in form to (9.2) but has a somewhat different significance: 
the expression (9.2) defines the operator f corresponding to the physical 
quantity f, while the left-hand side of equation (13.7) is the time derivative 
of the operator of the quantity f itself. 


§14. The density matrix 

The description of a system by means of a wave function is the most 
complete description possible in quantum mechanics, in the sense indicated 
at the end of §1. 

States that do not allow such a description are encountered if we consider 
a system that is part of a larger closed system. We suppose that the closed 
system as a whole is in some state described by the wave function ¥(q, x), 
where x denotes the set of coordinates of the system considered, and g the 
remaining coordinates of the closed system. This function in general does 
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not fall into a product of functions of x and of g alone, so that the system 
does not have its own wave function.t 

Let f be some physical quantity pertaining to the system considered. Its 
operator therefore acts only on the coordinates x, and not on g. The mean 
value of this quantity in the state considered is 


f= | | E"l, x) f(g, x) dade. (14.1) 
We introduce the function p(x, x’) defined by 
p(x, x") = | ¥(q, x)F*(g, x’) da, (14.2) 


where the integration is extended only over the coordinates g; this function 
is called the density matrix of the system. From the definition (14.2) it is 
evident that the function is “Hermitian”: 


p™(x, x’) = p(x’, x). (14.3) 


The ‘‘diagonal elements” of the density matrix 


p(x, x) = Í l'EF (g, x)|? dg 


determine the probability distribution for the coordinates of the system. 
Using the density matrix, we can write the mean value f in the form 


f = f [fp(x, x')]e:=z dx. (14.4) 


Here f acts only on the variables x in the function p(x, x’); after calculating 
the result of its action, we put x’ = x. We see that, if we know the density 
matrix, we can calculate the mean value of any quantity characterizing the 
system. It follows from this that, by means of p(x, x’), we can also determine 
the probabilities of various values of the physical quantities in the system. 
Thus the state of a system which does not have a wave function can be 
described by means of a density matrix. This does not contain the co- 
ordinates g which do not belong to the system concerned, though, of course, 
it depends essentially on the state of the closed system as a whole. 

The description by means of the density matrix is the most general form 
of quantum-mechanical description of the system. The description by means 
of the wave function, on the other hand, is a particular case of this, cor- 
responding to a density matrix of the form p(x, x’) = ¥(x)¥*(x'). The 
following important difference exists between this particular case and the 
general one.; For a state having a wave function there is always a complete 


t In order that F(q, x) should (at a given instant) fall into such a product, the measurement 
as a result of which this state was brought about must completely describe the system con- 
sidered and the remainder of the closed system separately. In order that ¥(q, x) should 
continue to have this form at subsequent instants, it is necessarv tn addition that these parts of 
the closed system should not interact (see §2). Neither of these conditions 1s now assumed. 

t States having a wave function are called “pure” states, as distinct from ‘‘mixed”’ states, 
which are described by a density matrix. 
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set of measuring processes such that they lead with certainty to definite 
results (mathematically, this means that Y¥ is an eigenfunction of some opera- 
tor). For states having only a density matrix, on the other hand, there is no 
complete set of measuring processes whose result can be uniquely predicted. 

Let us now suppose that the system is closed, or became so at some instant. 
Then we can derive an equation giving the change in the density matrix with 
time, similar to the wave equation for the function. The derivation can be 
simplified by noticing that the required linear differential equation for 
p(x, x’, t) must be satisfied in the particular case where the system has a wave 
function, 1.e. 


Ae x D SIP tN, tye Cet). 


Difterentiating with respect to time and using the wave equation (8.1), we 
have 


W(x, oTa i(x',t 
= ) E(x, a ) 





ORE r ð 
th— = th'¥*(x’', t) 
Ot ot 


= ¥"(x', DAE (x, th—¥ (x, DA AFX, t), 
where H is the Hamiltonian of the system, acting on a function of x, 
and Hf’ is the same operator acting on a function of x’. The functions 


¥*(x', t) and ¥(x, t£) can obviously be placed behind the respective operators 
Ĥ and A’, and we thus obtain the required equation: 


th Gp(x, x’, t)/et = (H—H'*)p(x, x’, t). (14.5) 


Let ’,(x, t) be the wave functions of the stationary states of the system, 
i.e. the eigenfunctions of its Hamiltonian. We expand the density matrix 
in terms of these functions; the expansion consists of a double series in the 
form 

Akt) = Pp? Oe ae AEs, 
= LE a mnYn* (x Yml) PX En Edt, (14.6) 
mn 
For the density matrix, this expansion plays a part analogous to that of the 
expansion (10.3) for wave functions. Instead of the set of coefficients ap, 


we have here the double set of coefficients a,,,. These clearly have the pro- 
perty of being ‘“‘Hermitian’”’, like the density matrix itself: 


Anm” = lmn (14.7) 
For the mean value of some quantity f we have, substituting (14.6) in (14.4), 
f= DT dan Í #(x, DFY a(x, t) dx, 
mnan 


or 
f = EE arnfnm(t) = EE amnfnmeë ME Emt, (reJ 
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where fnm are the matrix elements of the quantity f. This expression is 
similar to formula (11.1).t+ 

The quantities a,,, must satisfy certain inequalities. The ‘diagonal 
elements” p(x, x) of the density matrix, which determine the probability 
distribution for the coordinates, must obviously be positive quantities. It 
therefore follows from the expression (14.6) (with x’ = x) that the quadratic 


form 
pape [see et ae 
nm 


constructed with the coefficients amn (where the €, are arbitrary complex 
quantities) must be positive. This places certain conditions, known from the 
theory of quadratic forms, on the quantities a,m. In particular, all the 
“‘diagonal”’ quantities must clearly be positive: 


ann = Q, (14.9) 
and any three quantities apns, &mm aNd a,,, must satisfy the inequality 
Oi, EAB (14.10) 


To the “‘pure’’ case, where the density matrix reduces to a product of 
functions, there evidently corresponds a matrix amn of the form 


ann = aman. (14.11) 


We shall indicate a simple criterion which enables us to decide, from the 
form of the matrix amn, whether we are concerned with a “pure” or a 
“mixed” state. In the pure case we have 


Ca en = a Amkikn 


Or 


(4) mn = Amn (14.12) 


i.e. the density matrix is equal to its own square. 
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Let us consider a closed system of particles not in an external field. Since 
all positions in space of such a system as a whole are equivalent, we can say, 
in particular, that the Hamiltonian of the system does not vary when the 
system undergoes a parallel displacement over any distance. It is sufficient 
that this condition should be fulfilled for an arbitrary small displacement. 

An infinitely small parallel displacement over a distance òr signifies-a trans- 
formation under which the radius vectors r, of all the particles (a being the 
number of the particle) receive the same increment 6r:r, > r, + ôr. An 


t The quantities amn torm the density matrix in the energy representation. The description 


of the states of a system by means of this matrix was introduced independently by L. Landau 
and F. Bloch in 1927. 
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arbitrary function x(r1, re, ...) of the coordinates of the particles, under 
such a transformation, becomes the function 


P(r, + êr, r+ ôr, ...) = P(r Ke, ...)4+6r. 2 Vay 
= (1+ôr. ~ Vari teres) 


(Vadenotes the operator of differentiation with respect to ra). The expression 
l+ôr. D Va 
a 


is the operator of an infinitely small displacement, which converts the 
function (r1, re, ...) into the function 


P(r, +6r, r.+6r, ... ). 

The statement that some transformation does not change the Hamiltonian 
means that, if we make this transformation on the function Ay, the result is 
the same as if we make it only on the function ys and then apply the operator Ê. 
Mathematically, this can be written as follows. Let O be the operator which 
effects the transformation in question. Then we have Ô( Ay) = A(Oys), whence 


OH—HO = 0, 


i.e. the Hamiltonian must commute with the operator Ô. 

In the case considered, the operator O is the operator of an infinitely 
smali displacement. Since the unit operator (the operator of multiplying 
by unity) commutes, of course, with any operator, and the constant factor òr 
can be taken in front of A, the condition OH — AO = Q0 reduces here to 


(£ V)JH—H(z Va) = 0. (15.1) 


As we know, the commutability of an operator (not containing the time 
explicitly) with H means that the physical quantity corresponding to that 
operator is conserved. The quantity whose conservation for a closed system 
follows from the homogeneity of space is the momentum of the system (cf. 
Mechanics, §7). Thus the relation (15.1) expresses the law of conservation 
of momentum in quantum mechanics; the operator 2 Va must correspond, 
apart from a constant factor, to the total momentum of the system, and 
each term Va of the sum to the momentum of an individual particle. 

The coefficient of proportionality between the operator p of the momentum 
of a particle and the operator V can be determined by means of the passage 
to the limit of classical mechanics, and is —12h: 


p = —ihV, (15.2) 
Or, in components, 
Îa = —iholdx, f, = —ihdjay, f, = —ihd/az. 
Using the limiting expression (6.1) for the wave function, we have 


PYF = -ilih FVS = VS, 
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i.e. in the classical approximation the effect of the operator f reduces to 
multiplication by VS. The gradient VS of the action is the classical 
momentum p of the particle (see Mechanics, §43). 

It is easy to see that the operator (15.2) is Hermitian, as it should be. 
For, with arbitrary functions ¥(x) and ¢(x) which vanish at infinity, we have 


fapa -ih [gdin p dx = | ¥Btd de, 


and this is the condition that the operator should be Hermitian. 

Since the result of differentiating functions with respect to two different 
variables is independent of the order of differentiation, it is clear that the 
operators of the three components of momentum commute with one another: 


Pafs — fap =Q bsp.—Pibx = 0), Pubs— PP v = Q. (15.3) 

This means that all three components of the momentum of a particle can 
simultaneously have definite values. l 

Let us find the eigenfunctions and eigenvalues of the momentum operators. 
They are determined by the vector equation 


~ihVys = py. (15.4) 


The solutions are of the form 
= Cee. (15.5) 


where C is a constant. If all three components of the momentum are given 
simultaneously, we see that this completely determines the wave function 
of the particle. In other words, the quantities pz, Py, Pz form one of the poss- 
ible complete sets of physical quantities for a particle. Their eigenvalues 
form a continuous spectrum extending from — œ to + æ. 

According to the rule (5.4) for normalizing the eigenfunctions of a con- 
tinuous spectrum, the integral f y* p dV taken over all space (dV = dx 
dy dz) must be equal to the delta function 6(p’—p).ft However, for reasons 
that will become clear from subsequent applications, it is more natural to 
normalize the eigenfunctions of the particle momentum by the delta function 
of the momentum difference divided by 27h: 


| bob, dV = 8 (Ey 


| Yo *p AV = (27h)8 8(p’ —p) (15.6) 


or, equivalently, 


(since each of the three factors in the three-dimensional delta function 


is 8[(p'1—px)/27h] = 2rh 8(p’x—px), and so on). 


+ The three-dimensional function 6(a) of a vector a ts defined as a product of delta functions 
of the components of the vector a: 5(a) = 6(az)8(ay) (az). 
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The integration is effected by means of the formulat 


E f eft dé = 6(a). (15.7) 


This shows that the constant in (15.5) is equal to unity if the normalization 
is according to (15.6):] 
pp = einer, (15.8) 


The expansion of an arbitrary wave function ẹ(r) of a particle in terms 
of the eigenfunctions 9, of its momentum operator is simply the expansion 
as a Fourier integral: 


d3 
We) = | ap ole) 


P a fi p-r d3p 
ins = | a(p)eʻi/he" —— (15.9) 





(27 (2rħ)3 


(where dp = dp,dp,dp,). The expansion coefficients a(p) are, according 
to formula (5.3), 


alp) = | grp") AV = f re-te dV. (15.10). 


The function a(p) can be regarded (see §5) as the wave function of the 
particle in the “momentum representation” ; |a(p)|* d?p/(2rA)? is the proba- 
bility that the momentum has a value in the interval dp. 

Just as the operator f corresponds to the momentum, determining its 
eigenfunctions in the coordinate representation, we can introduce the 
operator r of the coordinates of the particle in the momentum represen- 
tation. It must be defined so that the mean value of the coordinates can be 
represented in the form 





r= | errr) <P (15.11) 


On the other hand, this mean value is determined from the wave function 
(r) by 
r= f y*ry dV, 
t The conventional meaning of this formula is that the function on the left-hand side has 


the property (5.8) of the delta function. Substituting 6(x—a) in the form (15.7), we obtain 
from (5.8) the well-known Fourier integral formula 


f(a) = | ff (xel-) dx dé/2n 


t Note that with this normalization the probability density |4|? = 1, i.e. the function is 
normalized to "one particle per unit volume”. This agreement of normalizations is, of 
course, no accident; see the last footnote to §48. 
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Substituting (r) in the form (15.9) we have (integrating by parts) 


ryi(r) = (2nh)-3 | ra(p)etmP* d3p 
= (2nh)-9{ ihe#/™*[2a(p)/Op] dp. 
Using this expression and (15.10), we find 


F = (2nh)-3 Í Í b*(x)ih[Aa(p)/dp jet m" d3pdV 





— d3p 
= | thar (pileatp) pi Fe 


Comparing with (15.11), we see that the radius vector operator in the 
momentum representation 1s 
ê = thd/dp. (15.12) 


The momentum operator in this representation reduces simply to multipli- 
cation by p. 

Finally, we shall express in terms of p the operator of a parallel displace- 
ment in space over any finite (not only infinitesimal) distance a. By the 
definition of this operator (7) we must have 


Papir) = Y(r+a). 
Expanding the function (r+ a) in a Taylor series, we have 
y(r+a) = y(r)+a . ay(r)/art ..., 


or, introducing the operator p = —zhV, 


y(r+a) = E +a 6+5(52 : p+ aS |e. 


The expression in brackets is the operator 
Ta = et -B, (15.13) 


This is the required operator of the finite displacement. 
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Let us derive the rules for commutation between momentum and co- 
ordinate operators. Since the result of successively differentiating with 
respect to one of the variables x, y, z and multiplying by another of them 
does not depend on the order of these operations, we have 


f-y—yfz =0, Prz—zp, = 0, (16.1) 


and similarly for f}, fz 
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To derive the commutation rule for f, and x, we write 


(Pax— xp) = —ih O(mp)/Ox+ thx op/dx 
= —thyp. 


We see that the result of the action of the operator 6,x—xp, reduces to 
multiplication by —zH; the same is true, of course, of the commutation of Íy 
with y and f, with z. Thus we havet 


fax— xfa = —iħ, pyy-ypy = —th, p,2—2p, = —ih. (16.2) 
All the relations (16.1) and (16.2) can be written jointly in the form 
Pixu — xpi = — ilb (i,k = x,y, 2). (16.3) 


Before going on to examine the physical significance of these relations and 
their consequences, we shall set down two formulae which will be useful 
later. Let f(r) be some function of the coordinates. Then 


ĤÊf(r)— f) = —ih Vf. (16.4) 
For 
(SÊ) = —iR V (SHS VE] = igi: 


A similar relation holds for the commutator of r with a function of the 
momentum operator: 


f{(b)r—rf(b) = —theflop. (16.5) 


It can be derived in the same way as (16.4) if we calculate in the momentum 
representation, using the expression (15.12) for the coordinate operators. 

The relations (16.1) and (16.2) show that the coordinate of a particle along 
one of the axes can have a definite value at the same time as the components 
of the momentum along the other two axes; the coordinate and momentum 
component along the same axis, however, cannot exist simultaneously. In 
particular, the particle cannot be at a definite point in space and at the same 
time have a definite momentum p. 

Let us suppose that the particle is in some finite region of space, whose 
dimensions along the three axes are (of the order of magnitude of) Ax, Ay, Az. 
Also, let the mean value of the momentum of the particle be py. Mathe- 
matically, this means that the wave function has the form $ = u(r)jel/MPot, 
where u(r) is a function which differs considerably from zero only in the 
region of space concerned. We expand the function %4 in terms of the eigen- 
functions of the momentum operator (i.e. as a Fourier integral). The co- 
efficients a(p) in this expansion are determined by the integrals (15.10) of 
functions of the form u(r)e/™ Pt, If this integral is to differ consider- 
ably from zero, the periods of the oscillatory factor e('/™@rP)* must not be 
small in comparison with the dimensions Ax, Ay, Az of the region in which 
the function u(r) is different from zero. This means that a(p) will be con- 


+ These relations, discovered in matrix form by Heisenberg in 1925, formed the genesis of 
quantum mechanics. 
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siderably different from zero only for values of p such that (1/4)(p)-—p,)Ax S 
1, etc. Since |a(p)|? determines the probability of the various values of the 
momentum, the ranges of values of p,, £y, P, 1n which a(p) differs from zero 
are just those in which the components of the momentum of the particle may 
be found, in the state considered. Denoting these ranges by AP z, Ap,, Ap,» 
we thus have 


Ap,Ax ~h, Ap Ay ~i, Ap,Az ~À. (16.6) 


These relations, known as the uncertainty relations, were obtained by 
Heisenberg in 1927. 

We see that, the greater the accuracy with which the coordinate of the 
particle is known (i.e. the less Ax), the greater the uncertainty Ap, in the 
component of the momentum along the same axis, and vice versa. In parti- 
cular, if the particle is at some completely definite point in space (Ax = 
Ay = Az = 0), then Ap, = Ap, = Ap, = œ. This means that all values 
of the momentum are equally probable. Conversely, if the particle has a 
completely definite momentum p, then all positions of it in space are equally 
probable (this is seen directly from the wave function (15.8), whose squared 
modulus is quite independent of the coordinates). 

If the uncertainties of the coordinates and momenta are specified by the 
standard deviations 


8x = 4/[(x—x)2], Sp2 = v[(pz- pa, 


we can specify exactly the least possible value of their product (H. Weyl). Let 
us consider the one-dimensional case of a wave packet with wave function 
W(x) depending on only one coordinate, and assume for simplicity that the 
mean values of x and pz in this state are zero. We consider the obvious 


inequality 
a dup 
Jes 


where a is an arbitrary real constant. On calculating this integral, noticing 
that 


2 
dx > 0, 





| yt ax = (Bey, 








dy* d dhl? 
| (« y+ xy*— ) dx = | x dx = — | [yb]? dx = —1, 
J dx dx Jo sae 


La 





dý* dy ae 1 E 7 i l 
f ma R fv Eda = = | WB dx = (89a), 


we obtain 


a*(dx)*—a+(1/h?)(dpz)? 2 0. 
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If this quadratic (in ) trinomial is positive for all «, its discriminant must be 
negative, which gives the inequality 


Ox opz 2 Lh. (16.7) 


The least possible value of the product is $4, and occurs for wave packets with 
wave functions of the form 


uh (16.8) 


1 si b z x? ) 
= = M —= 0 —_ m ’ 
(QnyW4 «/(5x) G 4(8x)2 
where po and ôx are constants. The probabilities of the various values of the 

coordinates in such a state are 


3 1 w 

l? = ———— exp [ ==; } 

o/(27) . dx 2(6x)? 

and thus have a Gaussian distribution about the origin (the mean value ¥ = 0) 
with standard deviation 6x. The wave function in the momentum represen- 
tation 1s 


Y(x)e—/Mpse dx. 


a( pz) = 


g mmg 


Calculation of the integral gives 


a(pz) = constant x exp — 


The distribution of probabilities of values of the momentum, |a(pz)|?, is also 
Gaussian about the mean value z = po, with standard deviation ôpr = 
h/28x, so that the product 4pz6x is indeed $f. 

Finally, we shall derive another useful relation. Let fand g be two physical 
quantities whose operators obey the commutation rule 


jé—if = —ihé, (16:9) 


where ¢ is the operator of some physical quantity c. On the right-hand side 
of the equation the factor fi is introduced in accordance with the fact that in 
the classical limit (i.e. as A — 0) all operators of physical quantities reduce 
to multiplication by these quantities and commute with one another. Thus, 
in the ‘‘quasi-classical”’ case, we can, to a first approximation, regard the right- 
hand side of equation (16.9) as being zero. In the next approximation, the 
operator ¢ can be replaced by the operator of simple multiplication by the 
quantity c. We then have 
fe—sf = —ihe 


This equation is exactly analogous to the relation £,.x—xp, = —ih, the only 
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difference being that, instead of the constant f, we havet the quantity Ac. 
We can therefore conclude, by analogy with the relation AxAp, ~ A, that 
in the quasi-classical case there is an uncertainty relation 


AfAg ~ he (16.10) 


for the quantities f and g. 

In particular, if one of these quantities is the energy (f=H) and the 
operator (f) of the other does not depend explicitly on the time, then by 
(9.2) c = g, and the uncertainty relation in the quasi-classical case is 


AEAg ~ hg. (16.11) 


+ The classical quantity c is the Poisson bracket of the quantities f and g; see the footnote 
in §9. 


CHAPTER II 


SCHRODINGER’S EQUATION 


§17. Schrodinger’s equation 

THE form of the wave equation of a physical system is determined by its 
Hamiltonian, which is therefore of fundamental significance in the whole 
mathematical formalism of quantum mechanics. 

The form of the Hamiltonian for a free particle is established by the 
general requirements imposed by the homogeneity and isotropy of space and 
by Galileo’s relativity principle. In classical mechanics, these requirements 
lead to a quadratic dependence of the energy of the particle on its momentum: 
E = p*j2m, where the constant m is called the mass of the particle (see 
Mechanics, §4). In quantum mechanics, the same requirements lead to a 
corresponding relation for the energy and momentum eigenvalues, these 
quantities being conserved and simultaneously measurable (for a free 
particle). 

If the relation E = p?/2m holds for every eigenvalue of the energy and 
momentum, the same relation must hold for their operators also: 


H = (1/2m)((2+6,2+,2). (17.1) 


Substituting here from (15.2), we obtain the Hamiltonian of a freely moving 
particle in the form 


H = —(#/2m)A, (17.2) 


where A = 6?/0x*+ 0?/dy?+40?/02 is the Laplacian operator. 
The Hamiltonian of a system of non-interacting particles 1s equal to the 
sum of the Hamiltonians of the separate particles: 


H = —}h? X(1/m,)A, (17.3) 


(the suffix a is the number of the particle; A, is the Laplacian operator in 
which the differentiation is with respect to the coordinates of the ath particle). 

In classical (non-relativistic) mechanics, the interaction of particles is 
described by an additive term in the Hamiltonian, the potential energy of the 
interaction U(ri, re, ...), which is a function of the coordinates of the particles. 
By adding a similar function to the Hamiltonian of the system, the interaction 
of particles can be represented in quantum mechanics :t 


A = 4} E Amt U(r, te ---)- (17.4) 


+ This statement is, of course, not a logical consequence of the basic principles of quantum 
mechanics, and is to be regarded as a deduction from experiment. 
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The first term can be regarded as the operator of the kinetic energy and the 
second as that of the potential energy. In particular, the Hamiltonian for a 
single particle in an external feld is 


H = p?/2m4U(x,y, 2) = —(#2/2m)4 + U(x, y, 2), (17.5) 


where U(x, y, z) is the potential energy of the particle in the external field. 

Substituting the expressions (17.2) to (17.5) in the general equation (8.1), 
we obtain the wave equations for the corresponding systems. We shall write 
out here the wave equation for a particle in an external field: 


ih OV jot = —(h?/2m)A¥ + U(x, y, 2)¥. (17.6) 
The equation (10.2), which determines the. stationary states, takes the form 
(A? /2m)Ay +[E— U(x, y, 2) ]p = 0. (17.7) 


The equations (17.6) and (17.7) were obtained by Schrédinger in 1926 and 
are called Schrédinger’s equations. 
For a free particle, equation (17.7) has the form 


(H2/2m) Ap + Ey = 0. (17.8) 


This equation has solutions finite in all space for any positive value of the 
energy E. For states with definite directions of motion, these solutions are 
eigenfunctions of the momentum operator, with E = p?/2m. The complete 
(time-dependent) wave functions of such stationary states are 


Y = constant x e-@/MEHG /n)p-r (17.9) 


Each such function, a plane wave, describes a state in which the particle has a 
definite energy E and momentum p. The angular frequency of this wave is 
Ejk and its wave vector k = p/A; the corresponding wavelength 2z/i/p is 
called the de Broglie wavelength ot the particle. 

The energy spectrum of a freely moving particle is thus found to be con- 
tinuous, extending from zero to +00. Each of these eigenvalues (except 
E = 0) is degenerate, and the degeneracy is infinite. For there corresponds 
to every value of E, different from zero, an infinite number of eigenfunctions 
(17.9), differing in the direction of the vector p, which has a constant absolute 
magnitude. 

Let us enquire how the passage to the limit of classical mechanics occurs 
in Schrédinger’s equation, considering for simplicity only a single particle 
in an external field. Substituting in Schrédinger’s equation (17.6) the limit- 
ing expression (6.1) for the wave function, ¥ = ae(*/™)S, we obtain, on per- 
forming the differentiation, 


oS Pe a ys) th Re ih Sane RA 7 
TOET | a alle aes —— > va — a=0. 
É ot ot tom ane m 2m oa: 


t The idea of a wave related to a particle was first introduced by L. de Broglie in 1924. 
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In this equation there are purely real and purely imaginary terms (we recall 
that S and a are real); equating each separately to zero, we obtain two 
equations 





aS 1 JS}+U h? ik 
ee e = a> A 
ot taa 2ma 
ða a l 
—+—ÂS+— VS. Va =0. 
ot 2m m 
Neglecting the term containing Å? in the first of these equations, we obtain 
oS ıl 
-4 Us) 0, (17.10) 
ot 2m 


that is; the classical Hamilton-Jacobi equation for the action S of a particle, 
as it should be. We see, incidentally, that, as A — 0, classical mechanics 1s 
valid as far as quantities of the first (and not only the zero) order in /# inclusive. 

The second equation obtained above, on multiplication by 2a, can be re- 


written in the form 
Qa? VS 
—+div( a ) = 0. (17.11) 
t 


m 





This equation has an obvious physical meaning: a? is the probability density 
for finding the particle at some point in space (||? = a2); VS/m = p/m 
is the classical velocity v of the particle. Hence equation (17.11) is simply 
the equation of continuity, which shows that the probability density ‘‘moves”’ 
according to the laws of classical mechanics with the classical velocity v at 
every point. 


PROBLEM 
Find the transformation law for the wave function in a Galilean transformation. 
SOLUTION. Let us apply the transformation to the wave function for free motion of a 
particle (a plane wave). Since any function VY can be expanded in plane waves, this will also 
give the transformation law for any wave function. 
The plane waves in the frames of reference K and K’ (K’ moving with velocity V relative 
to K) are 


Y(r, t) = constant x eft/#4P-"-E 0), 
¥"(r', 1) = constant x glia P = -E 0, 
where r = r’+ Vz; the particle momenta and energies in the two frames are related by 
p =p tm, E = E'+V.p'+4mV2 
(see Mechanics, §8). Substitution of these expressions in Y gives 


Tr D= OF ee iexp i (mV.r' + imre) | 


= ¥'(r—Vt, t) exp E (mV. — bm) |; (1) 


This formula does not contain the parameters of the free motion of the particle, and gives the 
required general transformation law for the wave function of any state of the particle. Fora 
system of particles, the exponent in (1) contains a summation over the particles. 
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§18. The fundamental properties of Schrodinger’s equation 


The conditions which must be satisfied by solutions of Schrédinger’s 
equation are very general in character. First of all, the wave function must 
be single-valued and continuous in all space. The requirement of continuity 
is maintained even in cases where the field U(x, y, 2) itself has a surface of 
discontinuity. At such a surface both the wave function and its derivatives 
must remain continuous. The continuity of the derivatives, however, does 
not hold if there is some surface beyond which the potential energy U becomes 
infinite. A particle cannot penetrate at all into a region of space where 
U = œ, i.e. we must have y = 0 everywhere in this region. The continuity 
of ys means that ys vanishes at the boundary of this region; the derivatives of 
ys, however, in general are discontinuous in this case. 

If the field U(x, y, z) nowhere becomes infinite, then the wave function 
also must be finite in all space. The same condition must hold in cases where 
U becomes infinite at some point but does so only as 1/r* with s < 2 (see 
also §35). 

Let Umin be the least value of the function U(x, y, z). Since the Hamil- 
tonian of a particle is the sum of two terms, the operators of the kinetic 
energy (7) and of the potential energy, the mean value £ of the energy in any 
state is equal to the sum 7+U. But all the eigenvalues of the operator T 
(which is the Hamiltonian of a free particle) are positive; hence the mean 
value 7 > 0. Recalling also the obvious inequality U > U min we find that 
E> Uin Since this inequality holds for any state, it is clear that it ts valid 
for all the eigenvalues of the energy: 


En > Onin: (18.1) 


Let us consider a particle moving in an external field which vanishes at 
infinity; we define the function U(x, y, 2), in the usual way, so that it vanishes 
at infinity. It is easy to see that the spectrum of negative eigenvalues of the 
energy will then be discrete, i.e. all states with E < 0 in a field which vanishes 
at infinity are bound states. For, in the stationary states of a continuous 
spectrum, which correspond to infinite motion, the particle reaches infinity 
(see §10); however, at sufficiently large distances the field may be neglected, 
the motion of the particle may be regarded as free, and the energy of a freely 
moving particle can only be positive. 

The positive eigenvalues, on the other hand, form a continuous spectrum 
and correspond to an infinite motion; for E > 0, Schrédinger’s equation 
in general has no solutions (in the field concerned) for which the integral 
f |b|? dV converges.t 

Attention must be drawn to the fact that, in quantum mechanics, a particle 
in a finite motion may be found in those regions of space where E < U; 
the probability |x|? of finding the particle tends rapidly to zero as the distance 
into such a region increases, yet it differs from zero at all finite distances. 


+ However, it must be mentioned that, for some particular mathematical forms of the 
function U(x, y, z) (which have no physical significance), a discrete set of values may be 
absent from the otherwise continuous spectrum. 
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Here there is a fundamental difference from classical mechanics, in which 
a particle cannot penetrate into a region where U > E. In classical mechanics 
the impossibility of penetrating into this region is related to the fact that, 
for E < U, the kinetic energy would be negative, that is, the velocity would 
be imaginary. In quantum mechanics, the eigenvalues of the kinetic energy 
are likewise positive; nevertheless, we do not reach a contradiction here, 
since, if by a process of measurement a particle is localized at some definite 
point of space, the state of the particle is changed, as a result of this process, 
in such a way that it ceases in general to have any definite kinetic energy. 
If U(x, y, z) > Oin all space (and U —> Oat infinity), then, by the inequality 
(18.1), we have E, > 0. Since, on the other hand, for E > 0 the spectrum 
must be continuous, we conclude that, in this case, the discrete spectrum 
is absent altogether, i.e. only an infinite motion of the particle is possible. 
Let us suppose that, at some point (which we take as origin), U tends to 
— oo in the manner 
U x —or- (x > 0). (18.2) 


We consider a wave function finite in some small region (of radius rọ) about 
the origin, and equal to zero outside this region. The uncertainty in the 
values of the coordinates of a particle in such a wave packet is of the order 
of rọ; hence the uncertainty in the value of the momentum is ~ fi/ro. The - 
mean value of the kinetic energy in this state is of the order of /i?/mr,?, and 
the mean value of the potential energy is ~ —a/r)*. Let us first suppose 
that s > 2. Then the sum 
hi2/mr,?—«a/r¢° 


takes arbitrarily large negative values for sufficiently small rọ. If, however, 
the mean energy can take such values, this always means that the energy has 
negative eigenvalues which are arbitrarily large in absolute value. The mo- 
tion of the particle in a very small region of space near the origin corresponds 
to the energy levels with large |E]. The “normal” state corresponds to a 
particle at the origin itself, i.e. the particle “falls” to the point r = 0. 

If, however, s < 2, the energy cannot take arbitrarily large negative values. 
The discrete spectrum begins at some finite negative value. In this case the 
particle does not fall to the centre. It should be mentioned that, in classical 
mechanics, the fall of a particle to the centre would be possible in principle 
in any attractive field (i.e. for any positive s). The case s = 2 will be specially 
considered in §35. 

Next, let us investigate how the nature of the energy spectrum depends on 
the behaviour of the field at large distances. We suppose that, as r > ©, 
the potential energy, which is negative, tends to zero according to the power 
law (18.2) (r is now large in this formula), and consider a wave packet “‘filling” 
a spherical shell of large radius rọ and thickness Ar & rọ Then the order 
of magnitude of the kinetic energy is again #?/m(Ar)*, and of the potential 
energy, —a/ro’. We increase ro, at the same time increasing Ar, in such a 
way that Ar increases proportionally to rọ. Ifs < 2, then the sum H?/m(Ar)*— 
ajr,’ becomes negative for sufficiently large 79. Hence it follows that there 
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are stationary states of negative energy, in which the particle may be found, 
with a fair probability, at large distances from the origin. This, however, 
means that there are levels of arbitrarily small negative energy (it must be 
recalled that the wave functions rapidly tend to zero in the region of space 
where U > E). Thus, in this case, the discrete spectrum contains an infinite 
number of levels, which become denser and denser towards the level E = 0. 

If the field diminishes as —1/r* at infinity, with s > 2, then there are not 
levels of arbitrarily small negative energy. The discrete spectrum terminates 
at a level with a non-zero absolute value, so that the total number of levels is 
finite. 

Schrédinger’s equation for the wave functions y of stationary states is real, 
as are the conditions imposed on its solution. Hence its solutions can always 
be taken as real.t The eigenfunctions of non-degenerate values of the energy 
are automatically real, apart from the unimportant phase factor. For y* 
satisfies the same equation as ys, and therefore must also be an eigenfunction 
for the same value of the energy; hence, if this value is not degenerate, ¥ and 
us* must be essentially the same, i.e. they can differ only by a constant factor 
(of modulus unity). The wave functions corresponding to the same de- 
generate energy level need not be real, however, but by a suitable choice of 
linear combinations of them we can always obtain a set of real functions. 

The complete (time-dependent) wave functions ¥" are determined by an 
equation in whose coefficients z appears. This equation, however, retains the 
same form if we replace t in it by — t and at the same time take the complex 
conjugate.{ Hence we can always choose the functions ‘¥ in such a way that 
Y and ¥* differ only by the sign of the time. 

As is well known, the equations of classical mechanics are unchanged by 
time reversal, i.e. when the sign of the time is reversed. In quantum mechanics, 
the symmetry with respect to the two directions of time is expressed, as we 
see, in the invariance of the wave equation when the sign of t is changed and 
Y is simultaneously replaced by Y*. However, it must be recalled that this 
symmetry here relates only to the equation, and not to the concept of 
measurement itself, which plays a fundamental part in quantum mechanics 
(as we have explained in detail in §7). 


§19. The current density 


In classical mechanics the velocity v of a particle is related to its momentum 
by p = mv. A similar relation holds between the corresponding operators 
in quantum mechanics, as we should expect. This is easily shown by cal- 
culating the operator ¥ = r by the general rule (9.2) for the differentiation 
of operators with respect to time: 


% = (i/h)(Hr—rH). 
t These assertions are not valid for systems in a magnetic field 


t It is assumed that the potential energy U does not depend explicitly on the time: the 
system is either closed or in a constant (non-magnetic) field. 
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Using the expression (17.5) for H and formula (16.5), we obtain 


è = fim. (19.1) 


Similar relations will clearly hold between the eigenvalues of the velocity 
and momentum, and between their mean values in any state. 

The velocity, like the momentum of a particle, cannot have a definite value 
simultaneously with the coordinates. But the velocity multiplied by an 
infinitely short time interval dt gives the displacement of the particle in the 
time dt. Hence the fact that the velocity cannot exist at the same time as 
the coordinates means that, if the particle is at a definite point in space at 
some instant, it has no definite position at an infinitely close subsequent 
instant. 

We may notice a useful formula for the operator f of the derivative, with 
respect to time, of some quantity f(r) which 1s a function of the radius vector 
of the particle. Bearing in mind that f commutes with U(r), we find 


f = (i/h\(Af—fH) = (1'2mhy -f P’). 
Using (16.4), we can write 
Pf =P . (fP—hV/f). 
Ip = (Pf +ihVf).B. 
Thus we obtain the required expression: 
f= (1/2m\(p . VJ+ VF -B). (19.2) 
Next, let us find the acceleration operator. We have 
$ = (i/h)(A% — 6H) = (i/mh)\(A p — PA) = (i!mh)(Up — PU) 
Using formula (16.4), we find 
my = —VU. (19.3) 


This operator equation is exactly the same in form as the equation of motion 
(Newton’s equation) in classical mechanics. 

The integral f |¥ |? dV’, taken over some finite volume IV, is the probability 
of finding the particle in this volume. Let us calculate the derivative of this 
probability with respect to time. We have 


d ar» DF i 
Fa [ rp dV = { (Ve dV = - i (F A*r" —Y* AY) dV. 
di., ot Ot h 


Substituting here 
A = At = —(}?/2m) A + U(x, y, 2) 
and using the identity 


PAYV*S—P* AY = div (¥VYF—f* Vy), 
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we obtain d 
£ | rev = -= favj dV, 


where j denotes the vectort 
j = (ih[/2m (YV Y* —-Y*V Y). (19.4) 
= L (ppry* + yep). 
2m 


The integral of div j can be transformed by Gauss’s theorem into an integral 
over the closed surface which bounds the volume V: 


d s 
ps f |¥]2? dV = — $i saf. (19.5) 


It is seen from this that the vector j may be called the probability current density 
vector, or simply the current density. The integral of this vector over a surface 
is the probability that the particle- will cross the surface during unit time. 
The vector j and the probability density |¥]|? satisfy the equation 


a\¥'|2/at +div j = 0, (19.6) 


which is analogous to the classical equation of continuity. 

The wave function of free motion (the plane wave (17.9)) can be normalized 
so as to describe a flow of particles with unit current density (in which, on 
average, one particle crosses a unit cross-section of the flow per unit time). 
This function is then 


Wr e 45-08 (Et-P-1). (19.7) 


fv 


where v is the velocity of the particle, since substitution of this in (19.4) gives 
j = pjmv, i.e. a unit vector in the direction of the motion. 

It is useful to show how the orthogonality of the wave functions of 
states with different energies follows immediately from Schrédinger’s 
equation. Let Ym and ¥,, be two such functions; they satisfy the equations 


—(F2/2m) Ada Ob m =e En ms 
—(h?/2m) Apn" + Upan” = ninr. 


We multiply the first of these by ¥,,* and the second by Ym and subtract 
corresponding terms; this gives 


(Em— En) tmpat = (4/2) Apn —Yn* Op m) 
= (f?/2m) div (Y mV Pnt —¥n* Vtm). 


t If 4 is written as {y)|e?, then 
j = (him)? V2. (19.4a) 
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If we now integrate both sides of this equation over all space, the right-hand 
side, on transformation by Gauss’s theorem, reduces to zero, and we obtain 


(Em En) | tain” AV = 0, 


whence, by the hypothesis E,, # Ep, there follows the required orthogonality 
relation 


{ Umba® dV = 0. 


§20. The variational principle 


Schrédinger’s equation, in the general form Hy = Ey, can be obtained 
from. the variational principle 


§ { *(H—E)y dg = 0. (20.1) 


Since ys is complex, we can vary y and %* independently. Varying %*, we 
have 


f ayr A— E dg = 0, 


whence, because 5y* is arbitrary, we obtain the required equation Ay = Ey. 
The variation of % gives nothing different. For, varying % and using the 
fact that the operator Ĥ is Hermitian, we have 


f A-E) ag = | 54(A*—E\y* dg = 0, 


from which we obtain the complex conjugate equation A*y* = Ey*. 

The variational principle (20.1) requires an unconditional extremum of 
the integral. It can be stated in a different form by regarding E as a Lagran- 
gian multiplier in a problem with the conditional extremum requirement 


8 l yy dg = 0, (20.2) 
the condition being 
Í Yy" dg = 1. (20.3) 


The least value of the integral in (20.2) (with the condition (20.3)) is the first 
eigenvalue of the energy, i.e. the energy E, of the normal state. ‘The func- 
tion ys which gives this minimum is accordingly the wave function yo of the 
normal state.t The wave functions Y, (n > 0) of the other stationary states 
correspond only to an extremum, and not to a true minimum of the integral. 

In order to obtain, from the condition that the integral in (20.2) is a mini- 
mum, the wave function y1 and the energy Ei of the state next to the normal 
one, we must restrict our choice to those functions ¢ which satisfy not only the 


+ In the rest of this section we shall suppose the wave functions y¢ to be real; they can 
always be so chosen (if there is no magnetic field). 
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normalization condition (20.3) but also the condition of orthogonality with 
the wave function pọ of the normal state: f db, dg=0. In general, if the 
wave functions ho, Yy» ..., Yn Of the first n states (arranged in order of in- 
creasing energy) are known, the wave function of the next state gives a mini- 
mum of the integral in (20.2) with the additional conditions 


[Pdg=1, [eymdg=O0  (m=0,1,2,...,2—1). (20.4) 


We shall give here some general theorems which can be proved from the 
variational principle. t 

The wave function yọ of the normal state does not become zero (or, as we 
say, has no nodes) for any finite values of the coordinates.[ In other words, 
it has the same sign in all space. Hence, it follows that the wave functions 
Ya (n > 0) of the other stationary states, being orthogonal to Yo, must have 
nodes (if ,, is also of constant sign, the integral f Yon dg cannot vanish). 

Next, from the fact that W, has no nodes, it follows that the normal energy 
level cannot be degenerate. For, suppose the contrary to be true, and let 
Wo, Yo be two different eigenfunctions corresponding to the level Ey. Any 
linear combination cyig+c’ys, will also be an eigenfunction; but by choosing 
the appropriate constants c, c’, we can always make this function vanish at 
any given point in space, 1.e. we can obtain an eigenfunction with nodes. 

If the motion takes place in a bounded region of space, we must have 
ys = 0 at the boundary of this region (see §18). To determine the energy 
levels, it is necessary to find, from the variational principle, the minimum 
value of the integral in (20.2) with this boundary condition. The theorem that 
the wave function of the normal state has no nodes means in this case that 
Pa does not vanish anywhere inside this region. 

We notice that, as the dimensions of the region containing the motion 
increase, all the energy levels E,, decrease; this follows immediately from 
the fact that an extension of the region increases the range of functions which 
can make the integral a minimum, and consequently the least value of the 
integral can only diminish. 

The expression 


f bd dg = | [— E (2/2malyAayl-+ Uy] dg 


for the states of the discrete spectrum of a particle system may be transformed 
into another expression which 1s more convenient in practice. In the first 
term of the integrand we write 


eA ati = diva} Vat) — (Vap). 


+ The proof of theorems concerning the zeros of eigenfunctions (see also §21) is given by 
M. A. Lavrent’ev and L. A. Lyusternik, The Calculus of Variations (Kurs variatstonnogo 
ischislentya), 2nd edition, chapter IX, Moscow, 1950; R. Courant and D. Hilbert, Wethods of 
Mathematical Physics, volume I, chapter VI, Interscience, New York, 1953. 

t This theorem and its consequences are not in general valid for the wave functions of 
systems consisting of several identical particles (see the end of §63). 
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The integral of diva (Y Vays) over all space is transformed into an integral 
over an infinitely distant closed surface, and since the wave functions of the 
states of a discrete spectrum tend to zero sufficiently rapidly at infinity, this 
integral vanishes. Thus 


| YAy dg = [ [E (42/2) Va)? + UY?] da. (20.5) 


§21. General properties of motion in one dimension 


If the potential energy of a particle depends on only one coordinate (x), 
then the wave function can be sought as the product of a function of y and z 
and a function of x only. The former of these is determined by Schrédinger’s 
equation for free motion, and the second by the one-dimensional Schrédin- 
ger’s equation 


days ao p = 
ot lE— Ute) = 0. (21.1) 


Similar one-dimensional equations are evidently obtained for the problem of 
motion in a field whose potential energy is U(x, y, z) = Ui(x)+ Ue(y)+ Us(z), 
i.e. can be written as a sum of functions each of which depends on only one 
of the coordinates. In §§22—24 we shall discuss a number of actual examples 
of such “one-dimensional”? motion. Here we shall obtain some genera] 
properties of the motion. 

We shall show first of all that, in a one-dimensional problem, none of the 
energy levels of a discrete spectrum is degenerate. To prove this, suppose 
the contrary to be true, and let y, and y, be two different eigenfunctions 
corresponding to the same value of the energy. Since both of these satisfy 
the same equation (21.1), we have 


pi” h = (2m/h?)(U—E) = "be, 


or oye — yip" = 0 (the prime denotes differentiation with respect to x). 
Integrating this relation, we find 
Uy Pa— pipe = constant. (21.2) 


Since y = y = 0 at infinity, the constant must be zero, and so 


p pe Pape’ =0, 


or pi'i = pa'p Integrating again, we obtain ¥, = constant x yx, i.e. the 
two functions are essentially identical. 

The following theorem (called the oscillation theorem) may be stated for the 
wave functions y,,(x) of a discrete spectrum. The function p(x) correspond- 
ing to the (n+1)th eigenvalue E, (the eigenvalues being arranged in order of 
magnitude), vanishes n times (for finitet values of x). 


+ If the particle can be found only on a limited segment of the x-axis, we must consider 
the zeros of n(x) within that segment. 
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We shall suppose that the function U(x) tends to finite limiting values as 
x > +œ (though it need not be a monotonic function). We take the limiting 
value U(+ œ) as the zero of energy (i.e. we put U(+ 00) = 0), and we denote 
U(— œ) by U,, supposing that Us > 0. The discrete spectrum lies in the 
range of energy values for which the particle cannot move off to infinity; for 
this to be so, the energy must be less than both limiting values U(+ œ), 
i.e. it must be negative: 


E <0, (21.3) 


and we must, of course, have in any case E > U minw 
must have at least one minimum with Umin < 0. 
Let us now consider the range of positive energy values less than Uo: 


i.e. the function U(x) 


O< E<U, (21.4) 


In this range the spectrum will be continuous, and the motion of the particle 
in the corresponding stationary states will be infinite, the particle moving off 
towards x = +0. Itis easy to see that none of the eigenvalues of the energy 
in this part of the spectrum is degenerate either. To show this, it is sufficient 
to notice that the proof given above (for the discrete spectrum) still holds if 
the functions y, Y are zero at only one infinity (in the present case they tend 
to zero as x > — œ). 

For sufficiently large positive values of x, we can neglect U(x) in Schrö- 
dinger’s equation (21.1): 


yh" 4-(2m]h2) Ey = 0. 
This equation has real solutions in the form of a stationary plane wave 
yi = a cos(kx+ ò), (21.5) 


where a and 5 are constants, and the wave number k = p/h = y(2mE)jh. 
This formula determines the asymptotic form (for x >+ œ) of the wave 
functions of the non-degenerate energy levels in the range (21.4) of the 
continuous spectrum. For large negative values of x, Schrédinger’s equation 
is 


Y” —(2m|h2)\(Uy— Ey = 0. 
The solution which does not become infinite as x > — œ is 
by = besz, where k = 4/[2m(U,—E)]/h. (21.6) 


This is the asymptotic form of the wave function as x > —oo. Thus the 
wave function decreases exponentially in the region where E < U. 
Finally, for 
E>U, (21.7) 


the spectrum will be continuous, and the motion will be infinite in both 
directions. In this part of the spectrum all the levels are doubly degenerate. 
This follows from the fact that the corresponding wave functions are deter- 
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mined by the second-order equation (21.1), and both of the two independent 
solutions of this equation satisfy the necessary conditions at infinity (whereas, 
for instance, in the previous case one of the solutions became infinite as 
x > —oo, and therefore had to be rejected). The asymptotic form of the 
wave function as x > +00 is 


b= atty ase tk, (21 8) 


and similarly for x + —oo. The term e'** corresponds to a particle moving 
to the right, and e-*** corresponds to one moving to the left. 

Let us suppose that the function U(x) is even [U(—x) = U(x)]. Then 
Schrédinger’s equation (21.1) is unchanged when the sign of the co- 
ordinate is reversed. It follows that, if (x) is some solution of this equation, 
then s(—<x) is also a solution, and coincides with U(x) apart from a constant 
factor: ¥(—x) = (x). Changing the sign of x again, we obtain y(x) = 
c*U(x), whence c = +1. Thus, for a potential energy which is symmetrical 
(relative to x = Q), the wave functions of the stationary states must be either 
even [y(— x) = ¥(x)] or odd [ġ— (x)= —v(x)].¢ In particular, the wave 
function of the ground state is even, since it cannot have a node, while an 
odd function always vanishes for x = 0 [¥(0) = —(0) = 0). 

To normalize the wave functions of one-dimensional motion (in a continu- 
ous spectrum), there is a simple method of determining the normalization 
coefficient directly from the asymptotic expression for the wave function for 
large values of |x]. 

Let us consider the wave function of a motion infinite ın one direction, 
x> + œ. The normalization integral diverges as x > œ (as x> — oo, the 
function decreases exponentially, so that the integral rapidly converges). 
Hence, to determine the normalization constant, we can replace y by its 
asymptotic value (for large positive x), and perform the integration, taking as 
the lower limit any finite value of x, say zero; this amounts to neglecting a 
finite quantity in comparison with an infinite one. We shall show that the 
wave function normalized by the condition 


| von dx = (FF | = 2nh 8(p~p’), (21.9) 


a 


where p is the momentum of the particle at infinity, must have the asymptotic 
formu(2 1.5) vith S22: 


Yp = 2 cos (kx +ê) = etka +o) 4 e-ilkz +8), (21.10) 
Since we do not intend to verify the orthogonality of the functions corre- 
y g J 


+ In this discussion it is assumed that the stationary state 1s not degenerate, i.e. the motion 
is not infinite in both directions. Otherwise, when the sign of x is changed, two wave functions 
belonging to the energy level concerned may be transformed into each other. In this case, 
however, although the wave functions of the stationary states need not be even or odd, they 
can always be made so (by choosing appropriate linear combinations of the origina] functions). 
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sponding to different p, on substituting the functions (21.10) in the normali- 
zation integral we shall suppose the momenta p and p’ to be arbitrarily 
close; we can therefore put 6 = 6’ (in general ô is a function of p). Next, we 
retain in the integrand only those terms which diverge for p = p’; in other 
words, we omit terms containing the factor et(*tk 2, Thus we obtain 


co Ls 9) 


O 
f derijisi f Aik’ Sey + | p-itk’ -bx dy = Í ei(k’ -kz dx, 
0 0 — %0 


which, from (15.7), is the same as (21.9). 
The change to normalization by the delta function of energy is effected, in 
accordance with (5.14), by multiplying yp by 


d(p/2nh)\2 I 
( dE ) ~ (rhu) 


where v is the velocity of the particle at infinity. Thus 


i 
ag Jarh” 


1 ; 
2 (et(kz +6) 4 @ ~U(kz +8)), (21.11) 
/(2rhv) 


The current density is 1/27h in each of the travelling waves that make up the 
stationary wave (21.11). Thus we can formulate the following rule for the 
normalization of the wave function for a motion infinite in one direction by 
the delta function of energy: having represented the asymptotic expression 
for the wave function in the form of a sum of two plane waves travelling in 
opposite directions, we must choose the normalization coefficient in such a 
way that the current density in the wave travelling towards (or away from) 
the origin is 1/27h. 

Similarly, we can obtain an analogous rule for normalizing the wave func- 
tions of a motion infinite in both directions. The wave function will be 
normalized by the delta function of energy if the sum of the probability cur- 
rents in the waves travelling towards the origin from x = +œ and x = =œ 
is 1/27h. 


§22. The potential well 

As a simple example of one-dimensional motion, let us consider motion in 
a square potential well, i.e. in a field where U(x) has the form shown in Fig. 1 
(p 64)000%@) = O0'forW = & <a, UG) Uo tors = Candy coe. Itis 
evident a priori that for E < Uo the spectrum will be discrete, while for 
E > Uo we have a continuous spectrum of doubly degenerate levels. 
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Pics 1 


In the region 0 < x < a we have Schrédinger’s equation 


ih" 4-(2mn!R) Ey = 0 (22.1) 


(the prime denotes differentiation with respect to x), while in the region 
outside the well 


pi" + (2m; h®)(E—U,)ys = 0 (22.2) 


For x = 0 and x = a the solutions of these equations must be continuous 
together with their derivatives, while for x = +œ the solution of equation 
(22.2) must remain finite (for the discrete spectrum when E < Up it must 
vanish). 

For E < Up», the solution of equation (22.2) which vanishes at infinity is 


yy = constant xe7*t, where x = 4/[(2m/h?)(U,—E)]; (22.3) 


the signs — and + in the exponent refer to the regions x > a and x <0 
respectively. The probability |4|? of finding the particle decreases exponen- 
tially in the region where E < U(x). Instead of the continuity of % and wp’ 
at the edge of the potential well, it 1s convenient to require the continuity of 
y and of its logarithmic derivative ¥’/y. Taking account of (22.3), we obtain 
the boundary condition in the form 


Wh = Fk. (22.4) 


We shall not pause here to determine the energy levels in a well of arbitrary 
depth U, (see Problem 2), and shall analyse fully only the limiting case of 
infinitely high walls (U, > œ). 

For U, = ©, the motion takes place only between the points x = 0 and 
x = a and, as was pointed out in §18, the boundary condition at these points 
IS 


b= 0. (22.5) 
(It is easy to see that this condition ts also obtained from the general condition 
(22.4). For, when U, > œ, we have also x > œ and hence y'i} > œ; 


since yY’ cannot become infinite, it follows that fy = 0.) We seek a solution 
of equation (22.1) inside the well in the form 


y = csin(kx+8), where k = «/(2mE/h?). (22.6) 
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The condition y = 0 for x = 0 gives ô = 0, and then the same condition for 
x = a gives sin ka = 0, whence ka = nz, n being a positive integer,t or 

E, = (?h?/2ma®)n?, n = 1,2,3,.... (22.7) 


This determines the energy levels of a particle in a potential well. The 
normalized wave functions of the stationary states are 


wy, = /(2/a) sin(mnx/a). (22.8) 


From these results we can immediately write down the energy levels for a 
particle in a rectangular ‘‘potential box”, i.e. for three-dimensional motion 
in a field whose potential energy U = Ofor0 <x <a,0<y<b0<2<c¢ 
and U = œ outside this region. In fact, these levels are given by the sums 
geha (= No” 





Ninang 


2m \ g? D0? 


2 
+=) (711, ng, g = 1, 2 Saeed (22.9) 
c 


and the corresponding wave functions by the products 


8 X TI i TMa . Ta 
— sin—x en sin——z. (22.10) 


Le eae = 
Taare abc a c 


It may be noted that the energy Eo of the ground state is, by (22.7) or 
(22.9), of the order of #2/ml2, where / is the linear dimension of the region 
in which the particle moves. This result is in accordance with the uncertainty 
relation; nen the uncertainty in the coordinate is ~, the uncertainty in 
the momentum, and therefore the order of magnitude of the momentum 
itself, is ~A/l. The corresponding energy is ~ (h/l)?/m. 


PROBLEMS 


PROBLEM 1. Determine the probability distribution for various values of the momentum 
for the normal state of a particle in an infinitely deep square potential well. 


SOLUTION. The coefficients a(p) in the expansion of the function y¥, (22.8) in terms of the 
eigenfunctions of the momentum are 


1. > o 
a(p) z | Yp*yi dx = = | si (=) -iip dy. 
ya a 
0 
Calculating the integral and squaring its modulus, we obtain the required probability distri- 
bution: 
4 7h3 
|a(p)|? ae = Be ac cos? PS dp. 
rh (e= 2h 
PROBLEM 2. Determine the energy levels for the potential well shown in Fig. 2 (p. 66). 
SoLUTION. The spectrum of energy values E < U,, which we shall consider, is discrete. 
In the region x < 0 the wave function is 


y = aet, where «x, = V[(2m/h*?)(U,—E)}, 


+ For n = 0 we should have 4 = O identically. 
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U(x) 


Uz 


U, 


a x 
Fic, 2 


while in the region x > a 
y = cet, where x, = V/[(2mih?)(U,—B)]. 
Inside the well (0 < x < a) we look for yŅ in the form 
4 = csin(kx+8), where k=(2mE/h?). 
The condition of the continuity of y’/) at the edges of the well gives the equations 
k cot ô = n = y/[(2m h?)U,—R?), k cot(ka48) = — x, = — 4 [(2 h*)U,—?], 
or 
sin ô = kh 4/(2mU,), sin(ka+8) = —kħ 4/(2mU,). 
Eliminating 6, we obtain the transcendental equation 


ka = nn—sin™[kh/y/(2mU,)]—sin™ [Rhy \(2mU,)] (1) 


(where n = 1, 2, 3, ..., and the values of the inverse sine are taken between 0 and $7), whose 
roots determine the energy levels E = k*h*/2m. For each n there is in general one root; 
the values of n number the levels in order of increasing energy. 

Since the argument of the inverse sine cannot exceed unity, it is clear that the values of Å 
can lie only in the range from 0 to +/(2mU,/h*). The left-hand side of equation (1) increases 
monotonically with k, and the right-hand side decreases monotonically. Fence it is neces- 
sary, for a root of equation (1) to exist, that for k = /(2mU,/h?) the right-hand side should 
be Jess than the Jeft-hand side. In particular, the inequality 


ay(2mU,):h > 4n—sin AUU) (2) 


which is obtained for n = 1, is the condition that at least one energy level exists in the well. 
We see that for given and unequal U,, U; there are always widths a of the well which are so 
small that there is no discrete energy level. For U, = U}, the condition (2) is evidently always 
satisfied. 

For U, = U, = U, (a symmetrical well), equation (1) reduces to 


sin [hk y/(2mU,)) = (nn — ka). (3) 
Introducing the variable ¢ = 4ka, we obtain for odd n the equation 
cosé = yf, where ~v = (A a)\(2)mU5), (4) 


and those roots of this equation must be taken for which tan £ > 0. For even n we obtain 
the equation 

sing = y$, (5) 
and we must take those roots for which tan £ < 0. The roots of these two equations deter- 


mine the energy levels E = 2£2h? ma?. The number of levels is finite when y # 0. 

In particular, for a shallow well in which Uo < h2/ma?, we have y ® 1 and equation (5) 
has no root. Equation (4) has one root (with the upper sign on the right-hand side), 
€ = 1/y—1/2y5. Thus the well contains only one energy level, 

Eo = Up—(ma?/2h2)U 2, 
which is near the top of the well. 
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PROBLEM 3. Determine the pressure exerted on the walls of a rectangular “‘potential 
box’’ by a particle inside it. 


SoLUTION. ‘The force on the the wall perpendicular to the x-axis is the mean value of the 
derivative —2H/é@a of the Hamilton’s function of the particle with respect to the length of 
the box in the direction of the x-axis. The pressure is obtained by dividing this force by the 
area be of the wall. According to the formula (11.16), the required mean value is found by 
differentiating the eigenvalue (22.9) of the energy. The result is 


pl) = r2h2n,?/marsoc. 


§23. The linear oscillator 


Let us consider a particle executing small oscillations in one dimension 
(what is called a Linear oscillator). The potential energy of such a particle 
is 4mw2x?, where w is, in classical mechanics, the characteristic (angular) 
frequency of the oscillations. Accordingly, the Hamiltonian of the oscillator 
is 


AY =4 f? jmp imo? (23.1) 


Since the potential energy becomes infinite for x = + œ, the particle can 
have only a finite motion, and the energy eigenvalue spectrum is entirely 
discrete. 

Let us determine the energy levels of the oscillator, using the matrix 
method+. We shall start from the equations of motion in the form (19.3); 
in this case they give 


+w? = 0. (23.2) 
In matrix form, this equation reads 


(%) min fw my = 0. 


For the matrix elements of the acceleration we have, according to (11.8), 
(%)mn E imn )nn = —Wmn ee Hence we obtain 


(Omn — w?) mn = 0. 


Hence it is evident that all the matrix elements xmn vanish except those for 
which wmn = w or wmn = —w. We number all the stationary states so that 
the frequencies + w correspond to transitions n >n F 1, Le. @nnz1 = tw. 
Then the only non-zero matrix elements are xn,7_1. 

We shall suppose that the wave functions p, are taken real. Since x is a real 
quantity, all the matrix elements xmn are real. The Hermitian condition 
(11.10) now shows that the matrix xmn is symmetrical: 


Xmn = *nm: 


To calculate the matrix elements of the coordinate which are different 





+ This was done by Heisenberg in 1925, before Schrédinger’s discovery of the wave 
equation. 
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from zero, we use the commutation rule 


KK—KE = —ih/m, 
written in the matrix form 
(XX) mn—(*%) inn = —(th/M)E mn- 


By the matrix multiplication rule (11.12) we hence have for m =n 
i È (wniXniXin— XnlwinXin) = 21 = wnlXn? = —th/m. 


In this sum, only the terms with / = n + 1 are different from zero, so that 


we have 
(xn tln)? —(Xn,n ea)? = h/2mw. (23.3) 


From this equation we deduce that the quantities (xn,1)? form an arith- 
metic progression, which is unbounded above, but 1s certainly bounded 
below, since it can contain only positive terms. ‘Since we have as yet fixed 
only the relative positions of the numbers n of the states, but not their abso- 
lute values, we can arbitrarily choose the value of corresponding to the first 
(normal) state of the oscillator, and put this value equal to zero. Accordingly 
xo _, must be regarded as being zero identically, and the application of equa- 
tions (23.3) with n = 0, 1, ... successively leads to the result 


(Xp ny)? = nh[2mw. 


Thus we finally obtain the following expression for the matrix elements of 
the coordinate which are different from zero:t 


Xan = Xni, = \/(nh/2mw). (23.4) 


The matrix of the operator Ĥ is diagonal, and the matrix elements H,,,, 
are the required eigenvalues E, of the energy of the oscillator. To calculate 
them, we write 


Han = En = m(n tH w Jan] 
= dl È iwn nto nto” E Xna] 
=m E (wH wn). 
In the sum over 1, only the terms with / = n+1 are different from zero; 
substituting (23.4), we obtain 


Paeria 2 = 0,1, 2pcar (23.5) 


Thus the energy levels of the oscillator lie at equal intervals of Aw from 
one another. The energy of the normal state (n = 0) is 44w; we call atten- 
tion to the fact that it is not zero. 


+t We choose the indeterminate phases an (see the second footnote to §11) so as to obtain the 
plus sign in front of the radical in all the matrix elements (23.4). Such a choice ts always 
possible for a matrix in which only those elements are different from zero which correspond 
to transitions between states with adjacent numbers 
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The result (23.5) can also be obtained by solving Schrédinger’s equation. 
For an oscillator, this has the form 


dys 2m. nee T 
Tar —dmuw*x*)b = 0. (23.6) 


Here it is convenient to introduce, instead of the coordinate x, the dimension- 
less variable ¢ by the relation 


& = 4/(mw/h)x. (23.7) 
Then we have the equation 
p+ [(2E|fiw) £2} = 0; (23.8) 


here the prime denotes differentiation with respect to £. 

For large €, we can neglect 2E/fAw in comparison with £7; the equation 
W” = ab has the asymptotic integrals = e+#"* (for differentiation of this 
function gives p” = 4 on neglecting terms of order less than that of the 
term retained). Since the wave function % must remain finite as € >+0, 
the index must be taken with the minus sign. It is therefore natural to make 
in equation (23.8) the substitution 


yp = et"? y(E). (23.9) 


For the function x(£) we obtain the equation (with the notation (2E//iw)—' = 
2n; since we already know that E > 0, we have n > — $) 


x —2&x’+2nx = 0, (23.10) 


where the function x must be finite for all finite £, and for -+œ must not 
tend to infinity more rapidly than every finite power of € (in order that the 
function % should tend to zero). 

Such solutions of equation (23.10) exist only for positive integral (and 
zero) values of n (see §a of the Mathematical Appendices); this gives the 
eigenvalues (23.5) for the energy, which we know already. The solutions of 
equation (23.10) corresponding to various integral values of n are x = con- 
stant x H,,(£), where H,,(€) are what are called Hermite polynomials; these 
are polynomials of the nth degree in ¢, defined by the formula 


H,(&) = (—1)"ef d(e’) dé". (23.11) 


Determining the constants so that the functions pn satisfy the normalization 
condition 


f Øn(x) dx = 1, 
we obtain (see (a.7)) 


Mu 1/4 1 
Pals) = (=) maa nev imesh). (23.12) 
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Thus the wave function of the normal state is 
Plx) = (monfrh)'/4e—morri2n, (23.13) 
It has no zeros for finite x, which is as it should be. 
By calculating the integrals l Wrttmé dé, we can determine the matrix ele- 


ments of the coordinate; this calculation leads, of course, to the same values 
(23.4). 

Finally, we shall show how the wave functions ¢,, may be calculated by the 
matrix method. We notice that, in the matrices of the operators £ +iwĝ, 
the only elements different from zero are 


(ž—iwx)nin = —(F 410%) ai = —ix/(2whn/m). (23.14) 


Using the general formula (11.11), and taking into account the fact that 
w—-1 = 0, we conclude that 


(X-—iurx)yYig = 0. 
After substituting the expression = —i(f/m)d/dx, we obtain the equation 
dpojdx = —(mæjh)xpo, 
whose normalized solution is (23.13). And, since 
(Biok) na = (#Hiws)n natn = iV(2ohn]m) bn, 
we obtain the recurrence formula 


Yn = v(m|2wħn)[— (h/m) djdx + wx) bn 


E : F 2 = = : £2/2 sa £2 x 
e (2n) ( d a £) Pn- A \/(2n) e m 12 Pini) . 


when this is applied z times to the function (23.13), we obtain the expression 
(23.12) for the normalized functions yn. 








PROBLEMS 
PROBLEM 1. Determine the probability distribution of the various values of the momentum 
for an oscillator. 


SOLUTION. Instead of expanding the wave function of the stationary state in terms of the 
eigenfunctions of momentum, it is simpler in the case of the oscillator to start directly from 
Schrédinger’s equation in the momentum representation. Substituting in (23.1) the coordi- 
nate operator $ = thd/dp (15.12), we obtain the Hamiltonian in the p representation, 


H = 4p2im—4mu*h® d?/dp?. 


The corresponding Schrédinger's equation Ha(p) = Ea(p) for the wave function a(p) in the 
momentum representation is 


d? 2 2 
dp? muh? 2m 


This equation is of exactly the same form as (23.6); hence its solutions can be written down 
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at once by analogy with (23.12). Thus we find the required probability distribution to be 


2 dp a ee -PMO 
lanl)? 35 = znry. e Hp] V ima) dp. 


PROBLEM 2. Determine the lower limit of the possible values of the energy of an oscillator, 
using the uncertainty relation (16.7). 
SOLUTION. Since x? = x?+(dx)?, p? = p?+(8p)*, (16.7) gives for the mean value of the 
energy of the oscillator 
E = Smw2x? +! p2/m > imwa? (òx)? + (dp)? im 
> mu2h?/8(p)? + (8p)?/2m. 


On determining the minimum value of this expression (regarded as a function of 5p), we 
find the lower limit of the mean values of the energy, and therefore that of all possible values: 


E > thw. 


PROBLEM 3. Find the wave functions of the states of a linear oscillator that minimize the 
uncertainty relation, i.e. in which the standard deviations of the coordinate and momentum 
in the wave packet are related by 5p ôx = $A (E. Schrödinger 1926).t 


“SOLUTION. The required wave functions must have the form 


1 ip: x—x)2 ; 
(amaan P i 3 m j ia} 


Their dependence, on the coordinate at any instant is in accordance with (16.8), x = x(t) 
and p = p(t) = mx(t) being the mean values of the coordinate and the momentum; according 
to (19.3), we have, for a linear oscillator (U = 4mw?x?), p= —mw?x, and therefore for the 
mean values p = —mw?®x or 


(x, t) = 


x+ w?x = 0, (2) 


i.e. the function x(z) satishes the classical equation of motion. The constant factor in (1) is 
determined by the normalization condition 


am 
[meas =i; 


in addition to this factor, Y may contain a phase factor with a time-dependent phase ¢(). 
The unknown constant 6x and the unknown function ¢(¢) are found by substituting (1) in 
the wave equation 
2 aap 
Se der ee 
2m èx? ` ĉt 





With (2), the substitution gives 


(he ae aa) + ES ~~ eee + Atd = zan | Pee 
: h? 4(8x)3 2h? G(Sx)) Sx)? 5 
Hence (ôx)? = A/2mw and 

$ = TF w5?) + be, 


$= abt + hut. 


Thus we have finally 


Tt) = (=)" exp E - a} exp i- Stee -5h (3) 
m 2 


When x = 0 and p = 0, this becomes yy(x)e™2t?, the wave function of the oscillator ground 
state. 


+ These are called coherent states. 
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72 
The mean energy of the oscillator in a coherent state is 
= r? poe 
E= 2.2 ate 
2mi ~ 
(4) 


= 
P+ lmu2k? + hw = hofi + 4); 


2m 
the quantity # is the mean “number” of quanta Aw in the state. We see that the coherent 
state is completely specified by the function x(t) satisfying the classical equation (2). The 
general form of this. function may be given as 
ZONGE egi jat =R (5) 
vV (2mhw) 


The function (3) can be expanded in wave functions of the stationary states of the oscillator: 


œ 


T = È an¥a 


n=O 


¥1(x, t) = dn(x) exp {-—1(n + yot}. 


The coefficients in this expansion are (cf. §41, Problem 1) 


an = [yaey dx. (6) 


The probability for the oscillator to be in the nth state is therefore 


Wy = |a,\* = e-annin!, (7) 
the Poisson distribution. 
U(x) 
x 
-Å 
Fic. 3 


PROBLEM 4. Determine the energy levels for a particle moving in a field of potential energy 


(Fig. 3) 
U(x) = A(e2t=—2e-*2) 
(P. M. Morse). 
SoLUTION. The spectrum of positive eigenvalues of the energy is continuous (and the levels 
are not degenerate), while the spectrum of negative eigenvalues is discrete. 
Schrédinger’s equation reads 
Np /dx®-+ (2m/hi2)(E— Ae*2-4.2Ae~*2)y = O. 


We introduce a new variable 
24/(2mA) 
= et 


ah 
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(taking values from 0 to œ) and the notation (we consider the discrete spectrum, so that 
E < 0) 
= V(—2mE)/oh, n = +/(2mA)/oh—(s+}), (1) 


Schrédinger’s equation then takes the form 


i sê 
a ġ = 0. 





+y +(- Fy 


As £ —> œ, the function 4 behaves asymptotically as e+łé, while as £ — O it is proportional to 
¿+s, From considerations of finiteness we must choose the solution which behaves as e7} 
as €—> © and as £5 as —> 0. We make the substitution 


y = eE fwg) 
arid obtain for w the equation 


Ew +(2s+1—£)u'+nw =0, (2) 


which has to be solved with the conditions that w is finite as Ẹ —> 0, while as £ > œ, w tends 
to infinity not more rapidly than every finite power of €. Equation (2) is the equation for a 
confluent hypergeometric function (see §d of the Mathematical Appendices): 


w == F(—n, 254, £), 


A solution satisfying the required conditions is obtained for non-negative integral n (when 
the function F reduces to a polynomial). According to the definitions (1), we thus obtain 
for the energy levels the values 


ah 


ae ali mAN 


mtn], 


where 7 takes positive integral values from zero to the greatest value for which \/(2mA)/2h > 
n-+-4 (so that the parameter s is positive in accordance with its definition). Thus the discrete 
spectrum contains only a limited number of levels. If ./(2mA)/ah < 4, there is no discrete 
spectrum at all. 


U(x) 
x 
“Uo 
Fic. 4 
PROBLEM 5. The same as Problem 4, but with U = —U,/cosh? ax (Fig. 4). 


SOLUTION. ‘The spectrum of positive eigenvalues of the energy is continuous, while that of 
negative values is discrete; we shall consider the latter. Schrédinger’s equation is 





dy 2m = (E+ 


= 0 
dx dx ry 
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We put £ = tanh ax and use the notation 


€ = 1/(—2mE)/Az, 2mUp/a2h? = s(s+1), 


-i(-1+/['+ Fe) 


obtaining 





spoek [ooo y= 


This is the equation of the associated Legendre polynomials; it can be brought to hyper- 
geometric form by making the substitution ù = (1—£?)*/2 w(é) and temporarily changing 
the variable tou = 3(1—§): 
u(1 —u)w” +(e + 1)(1 — 2u)w" — (e —s)(e+ s+ 1) = 0. 
The solution finite for £ = 1 (i.e. for x = ©) is 
y = (1—£)¢/2Fle—s, e+54+1, +1, (1 —£)]. 
If % remains finite for ë = —1 (i.e. for x = —©), we must have e—s = —n, where 


n=0,1,2,...; then F is a polynomial of degree n, which is finite for é = —1. 
Thus the energy levels are determined by s — «=n, or 


E= h?g? 142 1 EmU, 0 2 
a a a 


There is a finite number of levels, determined by the condition « > 0, ie. n <s. 








§24. Motion in a homogeneous field 


Let us consider the motion of a particle in a homogeneous external field. 
We take the direction of the field as the axis of x; let F be the force acting 
on the particle in this field. In an electric field of intensity Æ, this force is 
F = eE, where e is the charge on the particle. 

The potential energy of the particle in the homogeneous field is of the 


form U = —Fx-+constant; choosing the constant so that U = 0 for x = 0, 
we have U = — Fx. Schrédinger’s equation for this problem is 
dys /dx2+-(2m/h*)(E+- Fx) = 0. (24.1) 


Since U tends to + coasx + — œ, and vice versa, it is clear that the energy 
levels form a continuous spectrum occupying the whole range of energy 
values E from —œ to +00. None of these eigenvalues is degenerate, and 
they correspond to motion which is finite towards x = — oo and infinite to- 
wards x = +0. 

Instead of the coordinate x, we introduce the dimensionless variable 


E = (x+ E/F)(2mF/ R). (24.2) 
Equation (24.1) then takes the form 
p+ = 0. (24.3) 
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This equation does not contain the energy parameter. Hence, if we obtain a 
solution of it which satisfies the necessary conditions of finiteness, we at 
once have the eigenfunction for arbitrary values of the energy. 

The solution of equation (24.3) which is finite for all x has the form (see 
$b of the Mathematical Appendices) 


¥(f) = AD(—64), (24.4) 


where 
1 ce e] 
Mf) = = i cos(4u3-+-ué) du 
V T 9 


is called the Azry function, while A is a normalization factor which we shall 
determine below. 

As € > — œ, the function $(€) tends exponentially to zero. The asymp- 
totic expression which determines (£) for large negative values of £ is (see 
(b.4)) 


A 
PE) z Jema ezpl ~se e]: (24.5) 


For large positive values of é, the asymptotic expression for ¥(£) is (see 


(b.5))T 
P(E) = A-1 sin(gé8/2-440). (24.6) 


Using the general rule (5.4) for the normalization of eigenfunctions of a 
continuous spectrum, let us reduce the function (24.4) to the form normalized 
by the delta function of energy, for which 


[WERE dx = 8(E'—E). (24.7) 


In §21 we gave a simple method of determining the normalization coefficient 
by means of the asymptotic expression for the wave functions. Following 
this method, we represent the function (24.6) as the sum of two travelling 
waves: 


WE) = RAE? exp(z[30°?—30]) +3 AE-14 exp(—i[388?— 47). 
The current density, calculated from each of these two terms, is 
v(A/2€1/4)2 = 4/[2(E + Fx)/m](A/2€1/4)2 = A2(2AF)/3/4m2/3, 


and equating this to 1/27h we find 


(2m)1/3 
A = ——. (24.8) 
gl /2F1 /6f2/3 


t It may be noted, by way of anticipation, that the asymptotic expressions (24.5) and (24.6) 
correspond to the quasi-classical expressions for the wave function in the classically inacces- 
sible and accessible regions (§47). 
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PROBLEM 
Determine the wave functions in the momentum representation for a particle in a homo- 
geneous field. 
SoL_uTION. The Hamiltonian in the momentum representation is 


IT = p 2m-ihF d dp, 


so that Schrédinger’s equation for the wave function a( p) has the form 
da /p? 
—1hF—+ (2-2) = 0 
dp \2m 


Solving this equation, we find the required functions 


agp) = (22hF)- lee IMFXEp-p cm) 


These functions are normahzed by the condition 


| 2e"(p)ae(p) dp = &(E'—E). 


§25. The transmission coefficient 


Let us consider the motion of particles in a field of the type shown in 
Fig. 5: U(x) increases monotonically from one constant limit (U = 0 as 
x > — œ) to another (U = U, as x + +œ). According to classical mech- 
anics, a particle of energy E < U, moving in such a field from left to right, 
on reaching such a “‘potential wall”, is reflected from it, and begins to move 
in the opposite direction; if, however, E > Up, the particle continues to 
move in its original direction, though with diminished velocity. In quantum 
mechanics, a new phenomenon appears: even for E > Up, the particle may 
be reflected from the potential wall. The probability of reflection must in 
principle be calculated as follows. 


ut) 





Fic. 5 


Let the particle be moving from left to right. For large positive values of 
x, the wave function must describe a particle which has passed ‘‘above the 
wall” and is moving in the positive direction of x, i.e. it must have the asymp- 
totic form 


for x > ©, x Aek, where k, = (1/h)\/[2m(E—U,)] (25.1) 


and A is a constant. To find the solution of Schrédinger’s equation which 
satisfies this boundary condition, we calculate the asymptotic expression for 


§25 The transmission coefficient 77 


x > — ©; itis a linear combination of the two solutions of the equation of 
free motion, i.e. it has the form 


for x > — œ, ù x ktg Beth, where k, = /(2mE)/h. (25.2) 


The first term corresponds to a particle incident on the wall (we suppose 
ys normalized so that the coefficient of this term is unity); the second term 
represents a particle reflected from the wall. The probability current 
density in the incident wave is k,, in the reflected wave k,|B|?, and in the 
transmitted wave k,|A|?. We define the transmission coefficient D of the par- 
ticle as the ratio of the probability current density in the transmitted wave 
to that in the incident wave: 


D = (halhy) Al? (25.3) 


Similarly we can define the reflection coefficient R as the ratio of the density 
in the reflected wave to that in the incident wave. Evidently R = 1—D: 


R = |B|? = 1—(hy/k,) Al? (25.4) 


(this relation between A and B is automatically satisfied). 

If the particle moves from left to right with energy E < Up, then k, is 
purely imaginary, and the wave function decreases exponentially as x > +o. 
The reflected current is equal to the incident one, i.e. we have “‘total reflec- 
tion” of the particle from the potential wall. We emphasize, however, that 
in this case the probability of finding the particle in the region where E < U 
is still different from zero, though it diminishes rapidly as x increases. 

In the general case of an arbitrary stationary state (with energy E > Uo), 
the asymptotic form of the wave function is given, both for x —— œ and for 
x > + œ, by a sum of waves propagated in each direction: 


= Ayetkiz+ Bye tkiz for x > —, 
pn as i \ (25.5) 


y = Agek:t 4 Boe tksz for x> +0. 


Since these expressions are asymptotic forms of the same solution of a linear 
differential equation, there must be a linear relation between the coefficients 
Aj, Bı and Ae, Bz. Let Ag = «41+ Bı, where «, $ are constants (in general 
complex) which depend on the specific form of the field U(x). The corres- 
ponding relation for Bz can then be written down from the fact that Schré- 
dinger’s equation is real. This shows that, if % is a solution of a given 
Schrédinger’s equation, the complex conjugate function ¥* is also a solution. 
The asymptotic forms 


ye = Ay*e tit 4 By xetkiz for x—>—, 
ys = Ate thet 4 Poretksz for x> +o 


differ from (25.5) only in the nomenclature of the constant coefficients; we 
therefore have B2* = «B,* + 8B4* or Bo = «*B, + 8*4. Thus the coefficients 
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in (25.5) are related by equations of the form 
Ag = aåı+ßBı, Bo = f*Ai + a*B). (25.6) 
The condition of constant current along the x-axis leads to the relation 
ky(|Ai|?—|Bal?) = k| 42|2— |B2]?). 
Expressing Az, Be in terms of Ay, Bı by (25.6), we find 
|x|?— |B]? = kı/ko. (25.7) 


Using the relation (25.6), we can show, in particular, that the reflection 
coefficients are equal (for a given energy E > Up) for particles moving in the 
positive and negative directions of the x-axis; the former case corresponds to 
putting Be = 0 in (25.5), and the latter case to A; = 0. In these two cases, 
By/A, = —f*/oe* and Ac/Be = B/a* respectively. The corresponding re- 
flection coefficients are 


Re= BUA 
Ry = |Ao/Bo|? 


whence it is clear that Ry = Ro. 


It is natural to call B,/A,; = —B*/a* and A,/B, = B/a* the reflection 
amplitudes for motion in the positive and negative directions respectively. 
They are equal in modulus but may have different phase factors. 


paas, 
[B/a*l?, 


PROBLEMS 


PROBLEM 1. Determine the reflection coefficient of a particle from a rectangular potential 
wall (Fig. 6); the energy of the particle E> U» 


u(x) 





Fic. 6 


SoLUTION. Throughout the region x > 0, the wave function has the form (25.1), while in 
the region x < 0 its form is (25.2). The constants A and B are determined from the condi- 
tion that ù and dġ/dx are continuous at x = 0: 


1+B =A, k(l1—B) = k,A, 
A = 2k,/(ki +h), B = (kı—k)/(ki+ ki). 


whence 
The reflection coefficient} is (25.4) 
R= e9) = (2y 
k, +k, PitPs 


For E = U, (k; = 0), R becomes unity, while for E — © it tends to zero as (U,/4E)*. 





+ In the limiting case of classical mechanics, the reflection coefficient must become zero. 
The expression obtained here, however, does not contain the quantum constant at all. This 
apparent contradiction is explained as follows. The classical limiting case is that in which 


§25 ` The transmission coefficient 79 


PROBLEM 2. Determine the transmission coefficient for a rectangular potential barrier(Fig. 7). 
U(x) 


Uo 


a 
Fic. 7 


SOLUTION. Let E be greater than Uo, and suppose that the incident particle is moving from 
left to right. Then we have for the wave function in the different regions expressions of the 
form 


for x < 0, Y = ek 24 Ae-iRT, 
for 0 <x < a, wy = Bet*,<4 B’e-**,=, 
for x > a, y = Celik = 


(on the side x > a there can be only the transmitted wave, propagated in the positive direc- 
tion of x). The constants A, B, B’ and C are determined from the conditions of continuity 
of % and d¢/dx at the points x = 0 and a. The transmission coefficient is determined as 
D = k,|C|?/Rk, = |C|?. On calculating this, we obtain 
Pan 4k Ek 
(k?—k,?)? sinuka + tk, k? 


For E < Uo, k, is a purely imaginary quantity; the corresponding expression for D is 
obtained by replacing k, by ixa, where fix, = +/[2m(U,—E)]: 
4k,*K0* 


O (k+ Kè)? sinh?ars thr 


PROBLEM 3. Determine the refiection coefficient for a potential wall defined by the formula 
U(x) = U,/(1 +677) (Fig. 5); the energy of the particle is E > Uù». 


SOLUTION. SchrGdinger’s equation is 





dy 2m U, | 
ree fa ) zi 
dx? R\N l4enat 
We have to find a solution which, as x — +, has the form 
yy = constant x e*.= 
We introduce a new variable 
ra = =g IT 
(which takes values from —-œ to 0), and seek a solution of the form 


y = E-** lou €), 


the de Broglie wavelength of the particle A~/i/p is small in comparison with the characteristic 
dimensions of the problem, i.e. the distances over which the field U(x) changes noticeably. 
In the schematic example considered, however, this distance is zero (at the point x = 0), so 
that the passage to the limit cannot be effected. 
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where w() tends to a constant as $ > 0 (e. as x > œ). For w(Ẹ) we find an equation of 
hypergeometric type: 


El — ëw" + (1 —2ik, 2) (1 — Ew t (kt — k ye x= = 0, 
which has as its solution the hypergeometric function 
w = Flifh,—hal/o,— iy + a)/o, —2ikala +1, E) 


(we omit a constant factor). As £ — 0, this function tends to 1, i.e. it satisfies the condition 


imposed. 
The asymptotic form of the function ¢ as -> — o (i.e. x > — œ) ist 


"EN g-tk, C,(— gyikk) la} C {— £)i,+k,) t] =(— 1)-* fel Cetk.t + C,e~s7), 


where 
P(—2ik, Ja) P(—2thefat 1) 
= Tm i(k pha) — i Ry + Radiat 1) 
T(2ik, ja) (—2ikaja+ 1) 
2 = TR —Aa) lilk — ka) a+ 1) 


Cı 


The required refiection coefficient is R = iC./C,|2; on calculating it by means of the well 
known formula 


[(x)P(i—x) = a/sin nx, 


we have 
sinh[a(k,— )ja] \? 


x Erm i 


For E = U, (k; = 0), R becomes unity, while for E> œ it tends to zero as 


2 
¢ Ue ) iar VimE) jon, 
oh / E 





In the limiting case of classical mechanics, R becomes zero, as it should. 


PROBLEM 4. Determine the transmission coefficient for a potential barrier defined by the 
formula 
U(x) = U,/cosh®ax 


(Fig. 8); the energy of the particle is E < Uo. 


u(x) 


Fic. 8 


TS 


+ See formula (e.6), in each of whose two terms we must take only the first term of the 
expansion, i.e. replace the hypergeometric functions of 1/z by unity. 
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SOLUTION. The Schrédinger’s equation is the same as that obtatned in the solution of 
Problem 5, §23; it is necessary merely to alter the sign of U, and to regard the energy E 
now as positive. A similar calculation gives the solution 

y= (1—€%)-#/2e Fl —tk/a—s, —ikja+s+1, —tk/o+1,3(1-£)}, 


where 
€ = tanh ax, k= \/(2mE)/h, 


(= fH) 


This solution satisfies the condition that, as x —> œ (i.e. as E —> 1, (1—&) = 2e-24*), the wave 
function should include only the transmitted wave (~e'**). The asymptotic form of the 
wave function as x > — œ© ( > —1) is found by transforming the hypergeometric function 
with the aid of formula (e.7): 





Ck:a) ~ tk: 2) re [{ —rkja)C(1 — iki) 


P(—s)P(l +s) V( -ikja—s)T(-1tk/a+s541) (2) 


w ~~ e -ikr 


Taking the squared modulus of the ratio of coefficients in this function, we obtain the follow- 
ing expression for the transmission coefficient D =1 — R: 


D sinh*( kj) 
~ sinh®(wk/a:)-+cos®{ har v'(1 —8mU,/hi232)] 


(if 8mUo/h2a2<1), or 
sinh*{ kya) 
~~ sinh*(hja)-+ cosh"(4ir v(8mUyihta®— 1)] 


(if 8mUo/A?a? >1). The first of these formulae holds also for the case Uo < 0, i.e. when 
the particle is passing over a potential well instead of a potential barrier. It is interesting to 
note that in that case D = 1 if 1+8m!Uo|/h2a2 = (2n+1)?: thus, for certain values of the 
depth | Uo] of the well, particles passing over it are not reflected. This is evident from equation 
(2), where the term in e~'*7 vanishes for positive integral s. 


PROBLEM 5. Determine how the transmission coefficient tends to zero as E — 0, assuming that 
the potential energy Ulx! decreases rapidly at disiances |x| >a. where a is the dimension of the 
interaction region. 

SoLUTION. For distances k|x| <1, E can be neglected in Schrédinger’s equation. If also |x| > a, 
the potential energy can also be neglected. and the equation becomes — (h?/2m) dh /dx? = 0; the 
solution of this mav be written as 


w=a,+b,x forx <0, yw =a,+6,x forx >0. l 


‘The relation between a,, 6, and a,, b, can be found by solving the equation at distances |x| ~ a. 
li is linear: 


a, = Pd, + pb), b, = va, + Th3. (2 

The coefficients p, 4, v and t are real and independent of the energy, which does not appear in the 

equation.* The solution [1] must be the same as the first two terms in the expansion of (25.1); and 
25.2) in powers of x. so that 

Substituting these in (2) and solving for A, we get, for small k, A = 2ik |v, whence D = 4k?/v? ~ E. 


The transmission coefficient thus tends to zero in proportion to the particle energy. This is of course 
true for the examples in Problems 2 and 4. 


t Since the flux is constant, pt — yar = l. 


CHAPTER IV 


ANGULAR MOMENTUM 


§26. Angular momentum 

In §15, to derive the law of conservation of momentum, we have made use 
of the homogeneity of space relative to a closed system of particles. Besides 
its homogeneity, space has also the property of isotropy: all directions in it 
are equivalent. Hence the Hamiltonian of a closed system cannot change 
when the system rotates as a whole through an arbitrary angle about an 
arbitrary axis. It is sufficient to require the fulfilment of this condition for an 
infinitely small rotation. 

Let dep be the vector of an infinitely small rotation, equal in magnitude 
to the angle 5¢ of the rotation and directed along the axis about which the 
rotation takes place. The changes ôr, (in the radius vectors r, of the par- 
ticles) in such a rotation are 


ôr, = d¢p X ra. 
An arbitrary function (rj, ra ... ) is thereby transformed into the function 


¥(r, +6r,,ro+dre,...) = pTi Tiz -.. )+% ér,- Vall 
= YF}, To... )+ * 50 X rg. Vay 
= (1+6e. Èr, x VY alr E )- 


The expression 
1+dep. 2 X Va 


is the operator of an infinitely smali rotation. The fact that an infinitely smali 
rotation does not alter the Hamiltonian of the system is expressed (cf. §15) by 
the commutability of the ‘rotation operator” with the operator H. Since 
5@ is a constant vector, this condition reduces to the relation 


(E rax VJH-H(Er, x Va) = 0, (26.1) 


which expresses a certain law of conservation. 

The quantity whose conservation for a closed system follows from the 
property of isotropy of space is the angular momentum of the system (cf. 
Mechanics, §9). Thus the operator È ra x Va must correspond exactly, 
apart from a constant factor, to the total angular momentum of the system, 
and each of the terms ra x Va of this sum corresponds to the angular momen- 
tum of an individual particle. 


82 
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The coefficient of proportionality must be put equal to —17k; then the 
expression for the angular momentum operator of a particle is — tår x V = 
r xp and corresponds exactly to the classical expression r xp. Hencefor- 
ward we shall always use the angular momentum measured in units of A. The 
angular momentum operator of a particle, so defined, will be denoted by Í, 
and that of the whole system by L. Thus the angular momentum operator 
of a particle is 


hÎ =rxp = —ihrxV, (26.2) 


or, in components, 
Al, =yp,—2zp,, hl, = xp,—xp,, fil, = xfy— yf 


For a system which is in an external field, the angular momentum is in 
general not conserved. However, it may still be conserved if the field has a 
Certain symmetry. Thus, if the system is in a centrally symmetric field, all 
directions in space at the centre are equivalent, and hence the angular momen- 
tum about this centre will be conserved. Similarly, in an axially symmetric 
field, the component of angular momentum along the axis of symmetry is 
conserved. All these conservation laws holding in classical mechanics are 
valid in quantum mechanics also. 

In a system where angular momentum is not conserved, it does not have 
definite values in the stationary states. In such cases the mean value of the 
angular momentum in a given stationary state is sometimes of interest. It is 
easily seen that, in any non-degenerate stationary state, the mean value of the 
angular momentum is zero. For, when the sign of the time is changed, the 
energy does not alter, and, since only one stationary state corresponds to a 
given energy level, it follows that when ¢ is changed into —t the state of the 
system must remain the same. This means that the mean values of all 
quantities, and in particular that of the angular momentum, must remain 
unchanged. But when the sign of the time is changed, so is that of the angular 
momentum, and we have L = —L, whence it follows that L = 0. The same 
result can be obtained by starting from the mathematical definition of the 
mean value L as being the integral of »*Lis. The wave functions of non- 
degenerate states are real (see the end of §18). Hence the expression 


L = -th Í p*(Era x Va)ysdg 


is purely imaginary, and since L must, of course, be real, it is evident that 


Let us derive the rules for commutation of the angular momentum operators 
with those of coordinates and linear momenta. By means of the relations 


(16.2) we easily find 
(L.,x} =0, {Ly} =iz, {1,2} = iy, 
fy} =0, {1,2} =ix, {lnx} = —iz, + (26.3) 
{f,,2} = 0, {l x} 7 Ly, {Ly} = — íx. ) 
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For instance, 


1y—yl. = (1/h)(yp.—2b.)9 —WOP2— 2h )(1/h) 
= —(2/h\{pyy} = iz. 
All the relations (26.3) can be written in tensor form as follows: 
{Ín Xr} = te p%1, (26.4) 


where e,,, 1s the antisymmetric unit tensor of rank three,t and summation 1s 
implied over those suffixes which appear twice (called dummy suffixes). 

It is easily seen that a similar commutation rule holds for the angular 
momentum and linear momentum operators: 


1 Px} = iei Pr- (26.5) 


By means of these formulae, it is easy to find the rules for commutation of 
the operators /,, Í an f, with one another. We have 


ddy —tda) e L(zp,—xp.)—(2h.—xP le 
Mass p-—*(L.6.—p.l,) 
= —iyp,tixp, = ihl,. 


Thus 
fnb = iln {h b3 =i, {anb =i, (26.6) 


or 
il, i9 Tr TN (26.7) 


Exactly the same relations hold for the operators Ê,, L,, L, of the total 
angular momentum of the system. For, since the angular momentum oper- 
ators of different individual particles commute, we have, for instance, 


Loy Elas ~ Thos Day = Z(lovlar—loslay) = 22 bases 
Thus 
{i is = ie a Da = ae fli, ey = lhe (26.8) 


The relations (26.8) show that the three components of the angular momen- 
tum cannot simultaneously have definite values (except in the case where all 
three components simultaneously vanish: see below). In this respect the 
angular momentum is fundamentally different from the linear momentum, 
whose three components are simultaneously measurable. 


+ The antisymmetric umt tensor of rank three, eix: (also called the unit axial tensor), is 
defined as a tensor antisymmetric in all three suffixes. with eiz3 = 1. It is evident that, of 
its 27 components, only 6 are not zero, namely those in which the suffixes 7, k, / form some 
permutation of 1, 2, 3. Such a component is +1 if the permutation 7, k, l is obtained from 
1, 2, 3 by an even number of transpositions of pairs of figures, and is —1 if the number of 
transpositions is odd. Clearly eikleikm = 2ôim, eixieixt = 6. The components of the vector 
C = A xB which is the vector product of the two vectors A and B can be written by means 
of the tensor ex: in the form 


C; = e%1A,B. 


§26 Angular momentum 85 


From the operators Êz, Ly, Ê: we can form the operator of the square of 
the modulus of the angular momentum vector, and which we denote by L?: 


Le = f£24+624+L,2. (26.9) 


This operator commutes with each of the operators Ê., L,, L4: 


(L2, La} a 0, {L} 3} =0, {b2 l} = 0. (26.10) 


Using (26.8), we have 

Ea L} = Bite Edta es a 

= —i(f,f,+£,L,), 

(ce L3 E i(LL,+L,L,), 

a L3 =0. 
Adding these equations, we have {L?, £,} = 0. Physically, the relations 
(26.10) mean that the square of the angular momentum, 1.e. its modulus, can 
have a definite value at the same time as one of its components. 


Instead of the operators Êz, Êy it is often more convenient to use the 
complex combinations 


Lyc bytia Wax Leahy: (26.11) 


It is easily verified by direct calculation using (26.8) that the following 
commutation rules hold: 


(Ef = Qh fee Ly (26.12) 
(La L3 = -L., 
and it is also not difficult to see that 
R= £,£.4+£2-L, 
=f f,4+£2+L,. (26.13) 


Finally, we shall give some frequently used expressions for the angular 
momentum operator of a single particle in spherical polar coordinates. 
Defining the latter by means of the usual relations 


x = y sin f cos ¢, y = rsin @sin 4d, g= cos G, 


we have after a simple calculation 


Í, = Eas (26.14) 


og 


: ô ô 
eee exto( gee eat a). (26.15) 
a0 ad 


m 
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Substitution in (26.13) gives the squared angular momentum operator of the 
particle: 








f2 E A 6 i | 26.16) 
= = ee S — Š $ 
| ap : = ( 


sin?@ sin @ 26 


It should be noticed that this is, apart from a factor, the angular part of the 
Laplacian operator. 


§27. Eigenvalues of the angular momentum 


In order to determine the eigenvalues of the component, in some direction, 
of the angular momentum of a particle, it is convenient to use the expression 
for its operator in spherical polar coordinates, taking the direction in question 
as the polar axis. According to formula (26.14), the equation fs = la) can 
be written in the form 

— i Oboe = Typ. (27.1) 


Its solution is 
p = f(r, 0), 


where f(r, 0) is an arbitrary function of r and @. If the function ¢ is to be single- 
valued, it must be periodic in ¢, with period 27. Hence we findt 


l, =m, where m =0,+1,+2.,.... (27.2) 


Thus the eigenvalues /, are the positive and negative integers, including 
zero. The factor depending on ¢, which characterizes the eigenfunctions of 
the operator /,, is denoted by 


D nlp) = (2x) Verne, (27.3) 


These functions are normalized so that 


[ Dnm (H) dh = mn. (27.4) 


The eigenvalues of the z-component of the total angular momentum of the 
system are evidently also equal to the positive and negative integers: 


L, =M, where M = 0,41, 4+2,... (27.5) 


(this follows at once from the fact that the operator Êz is equal to the sum of 
the commuting operators l; for the individual particles). 

Since the direction of the z-axis is in no way distinctive, it is clear that the 
same result is obtained for Êz, Ly and in general for the component of the 
angular momentum in any direction: they can all take integral values only. 
At first sight this result may appear paradoxical, particularly if we apply 
it to two directions infinitely close to each other. In fact, however, 1t must 


+ The customary notation for the eigenvalues of the angular momentum component is m, 
which also denotes the mass of a particle, but this should not lead to any confusion. 
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be remembered that the only common eigenfunction of the operators 
Lz, L, Lz corresponds to the simultaneous values 
y P 


L, =L,y =L, =0; 


in this case the angular momentum vector is zero, and consequently so is its 
projection upon any direction. If even one of the eigenvalues L,, Ly L, 1s 
not zero, the operators L,, [,, L, have no common eigenfunctions. In other 
words, there is no state in which two or three of the angular momentum 
components in different directions simultaneously have definite values differ- 
ent from zero, so that we can say only that one of them is integral. 

The stationary states of a system which differ only in the value of M have 
the same energy; this follows from general considerations, based on the 
fact that the direction of the z-axis is in no way distinctive. Thus the 
energy levels of a system whose angular momentum is conserved (and is not 
zero) are always degenerate. +t 

Let us now look for the eigenvalues of the square of the angular momen- 
tum. We shall show how these values may be found, starting from the 
commutation rules (26.8) only. We denote by wa, the wave functions of the 
stationary states with the same value of L?, belonging to one degenerate 
energy level, and distinguished by the value of M.} 

First of all we note that, since the two directions of the z-axis are physically 
equivalent, for every possible positive value M = |M] there is a correspond- 
ing negative value M = —|M]|. Let L (a positive integer or zero) denote the 
greatest possible value of |M| for a given L2. The existence of this upper 
limit follows because L2—£,2 = Êr? + L£,? is the operator of the essentially 
positive physical quantity Lz? + L,?, and its eigenvalues therefore cannot be 
negative. 

Applying the operator £,£ 4 to the eigenfunction yy of the operator Êz 
and using the commutation rule DEM L = +f, (26.12), we obtain 


Lb du = (M1) yar. (27.6) 


Hence we see that the function Ê xy is (apart from a normalization constant) 
the eigenfunction corresponding to the value M + 1 of the quantity Lz: 


at+1 = constant x iym, pm- = constant x Lyn. (277) 


t This is a particular case of the general theorem, mentioned in §10, which states that the 
levels are degenerate when two or more conserved quantities exist whose operators do not 
commute. Here the components of the angular momentum are such quantities. 

t Here it is supposed that there is no additional degeneracy leading to the same value of 
the energy for different values of the squared angular momentum. This is true for a discrete 
spectrum (except for the case of what is called accidental degeneracy in a Coulomb field; see 
§36) and in general untrue for the energy levels of a continuous spectrum. However, even 
when such additional degeneracy is present, we can always choose the eigenfunctions so that 
they correspond to states with definite values of L?, and then we can choose from these the 
states with the same values of E and L?. This is mathematically expressed by the fact that the 
matrices of commuting operators can always be simultaneously brought into diagonal form. 
In what follows we shall, in such cases, speak, for the sake of brevity, as if there were no 
additional degeneracy, bearing in mind that the results obtained do not in fact depend on 
this assumption, by what we have just said. 


88 Angular Momentum §27 
If we put M = L in the first of these equations, we must have identically 


Lyr = 0, (27.8) 


since there is by definition no state with M > L. Applying the operator L- 
to this equation and using the relation (26.13), we obtain 


Lir = (L2-L2-L) ir = 0. 


Since, however, the hy are common eigenfunctions of the operators L? and 
L, we have 


Lp = Lyr, Leye = Dr, Lapi = Lyn, 
so that the equation found above gives 
L? = L(L+1). (27.9) 


Formula (27.9) determines the required eigenvalues of the square of the 
angular momentum; the number L takes all positive integral values, including 
zero. For a given value of L, the component L, = M of the angular momen- 
tum can take the values 


N ene PE Dey Ee Be (27.10) 


i.e. 2L41 different values in all. The energy level corresponding to the 
angular momentum L thus has (2L + 1)-fold degeneracy; this is usually 
called degeneracy with respect to the direction of the angular momentum. 
A state with angular momentum L = 0 (when all three components are also 
zero) is not degenerate. The wave function of such a state is spherically 
symmetric, as is evident from the fact that the change in it under any in- 
finitesimal rotation, given by Ly, is in this case zero. 

We shall often, for the sake of brevity, and in accordance with custom, 
speak of the “angular momentum” L of a system, understanding by this an 
angular momentum whose square is L(L + 1); the s-component of the angular 
momentum is usually called just the “angular momentum component”. 

The angular momentum of a single particle 1 is denoted by the small letter 
l, for which formula (27.9) becomes 


12 = (1+). (27.11) 


Let us calculate the matrix elements of the quantities Lz and Ly in a 
representation in which L, and L?, as well as the energy, are diagonal 
(M. Born, W. Heisenberg and P. Jordan 1926). First of all, we note that, 
since the operators Êz and Êy commute with the Hamiltonian, their matrices 
are diagonal with respect to the energy, i.e. all matrix elements for transitions 
between states of different energy (and different angular momentum L) 
are zero. Thus it is sufficient to consider the matrix elements for transitions 
within a group of states with different values of M, corresponding to a single 
degenerate energy level. 

It is seen from formulae (27.7) that, in the matrices of the operators 
Ê, and £_, only those elements are different from zero which correspond 
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to transitions M—1 > M and M + M-1 respectively. Taking this into 


account, we find the diagonal matrix elements on both sides of the equation 
(26.13), obtainingt 


L(L+1) = (M|L4|M—1)<M-—1|L_|M) + e 
Noticing that, since the operators Ê, and Ê, are Hermitian, 
(M—-1|L_|M)> = (M|L.|M—-1)*, 
we can rewrite this equation in the form 
[|<M|L4|M—1)|? = L(L4+1)\-M(M—}) 


= (L—M+1)(L+™), 
whencef 
(M|L.|M-1) = <(M-1|L_|M> 
= v[(L+M)\(L—M+})]. (27.12) 


Hence we have for the non-zero matrix elements of the quantities L, and L; 
themselves 


(M|Le|M—1) = (M-1|La|M> = $V[(L+M\L-M+1)], 
(M|\Ly|M—1) = —(M—1|Ly|My = —HyY(L+MyL-Mary SO 


The diagonal elements in the matrices of the quantities Lz and Ly are zero. 
Since a diagonal matrix element gives the mean value of the quantity in the 
corresponding state, it follows that the mean values Lz and Ly are zero in 
states having definite values of Lz. Thus, if the angular-momentum com- 
ponent in a given direction in space has a definite value, the vector L itself is 
in that direction. 


§28. Eigenfunctions of the angular momentum 


The wave function of a particle is not completely determined when the 
values of / and m are prescribed. This is seen from the fact that the expres- 
sions for the operators of these quantities in spherical polar coordinates 
contain only the angles @ and ¢, so that their eigenfunctions can contain an 
arbitrary factor depending on r. We shall here consider only the angular 
part of the wave function which characterizes the eigenfunctions of the 
angular momentum, and denote this by Yim(6, ¢), with the normalization 
condition 


f | Yiml? do Si 
where do = sin @ d@éd¢ 1s an element of solid angle. 


t In the symbols for the matrix elements, we omit for brevity all suffixes with respect to 
which they are diagonal! (including L). 

t The choice of sign in this formula corresponds to the choice of the phase factors in the 
eigenfunctions of the angular momentum. 


90 Angular Momentum §28 


We shall see that the problem of determining the cornmon eigenfunctions 
of the operators I? and Í, admits of separation of the variables @ and ¢, and 
these functions can be sought i in the form 


Yim = Pm($)Oin(9), (28.1) 


where ©,,(¢) are the eigenfunctions of the operator /,, which are given by 
formula (27.3). Since the functions Ọm are already normalized by the condi- 
tion (27.4), the ©z, must be normalized by the condition 


f |©,m{2sin@ dé = 1. (28.2) 
0 
The functions Ym with different / or m are automatically orthogonal: 
on 7 
{ Í VAY) sinb dôde = Swè mn’, (28.3) 
0 0 


as being the eigenfunctions of angular momentum operators corresponding 
to different eigenvalues. The functions Ọ„(¢) separately are themselves 
orthogonal (see (27.4)), as being the eigenfunctions of the operator Í, cor- 
responding to different eigenvalues m of this operator. The functions ©,,,(8) 
are not themselves eigenfunctions of any of the angular momentum operators; 
they are mutually orthogonal for different /, but not for different m. 

The most direct method of calculating the required functions is by directly 
solving the problem of finding the eigenfunctions of the operator 1? written 
in spherical me coordinates i Pi The equation [2y = 124 is 











1 
no) + Y EME My = 0. 


sin@ 86 sin?@ od? 


Substituting in this equation the form (28.1) for ¢%, we obtain for the function 
© ın the equation 


1 d ( i siom 

———{ sing = 

sin@ dé 
This equation is well known’in the theory of spherical harmonics. It has 
solutions satisfying the conditions of finiteness and single-valuedness for 
positive integral values of ? > |m], in agreement with the eigenvalues of the 
angular momentum obtained above by the matrix method. The correspond- 
ing solutions are what are called associated Legendre polynomials P7(cos @) 
(see §c of the Mathematical Appendices). Using the normalization condition 


(28.2), we findt 
Orn(6) = (—1)"it V[H(24 1)(I—m)!/(-+m)!]P,;7(cos6). (28.5) 


Gig t UL+ 10m = 0. (28.4) 








+ The choice of the phase factor ts not, of course, determined by the normalization condi- 
tion. The definition (28.5) used in this book ts the most natural from the viewpoint of the 
theory of addition of angular momenta. It differs by a factor 7! from the one usually adopted. 
The advantages of this choice will be clear from the footnotes in §§60, 106 and 107. 
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Here it is supposed that m > 0. For negative m, we use the definition 
©; jmp = (—1)"Opm)- (28.6) 


In other words, Orm for m < 0 is given by (28.5) with |m| instead of m and 
the factor (— 1)” omitted. 

Thus the angular momentum eigenfunctions are mathematically just 
spherical harmonic functions normalized in a particular way. For reference, 
the complete expression embodying the above definitions is 


21+1(1—|ml)! 


Yim(6, $) = (—1yem timp sie rag T 


1/2 
Si | Pimi (cos 6)eimé, (28.7) 
In particular, 


mMer JE Plos 6). (28.8) 


It is evident that the functions differing in the sign of m are related by 
( a Deen Yr, -m = Yim*. (28.9) 


For / = 0 (so that m = 0 also) the spherical harmonic function reduces to 
a constant. In other words, the wave functions of the states of a particle with 
Zero angular momentum depend only on 7, i.e. they have complete spherical 
symmetry, in agreement with the general statement in §27. 

For a given m, the values of / starting from |m| denumerate the successive 
eigenvalues of the quantity 1? in order of increasing magnitude. Hence, from 
the general theory of the zeros of eigenfunctions (§21), we can deduce that the 
function ©,,, becomes zero for 1—|m| different values of the angle 6; in 
other words, it has as noda! lines 1—|m/| ‘‘lines of latitude” on the sphere. If 
the complete angular functions are taken with the real factors cos mp or 
sin me instead oft ettim e, they have as further nodal lines |m| “Iines of longi- 
tude”; the total number of nodal lines is thus Z. 

Feb we shall show how the functions © m may be calculated by the 
matrix method. This is done similarly to the calculation of the wave func- 
tions of an oscillator in §23. We start from the equation (27.8): 


Í Yu = 0. 


Using the expression (26.15) for the operator l, and substituting 
Yu = (27)-*e#*O1(6), we obtain for ©, the equation 


d0,,/dé—1 cot Oy = 0, 


whence ©); = constant X sin!@. Determining the constant from the normali- 
zation condition, we find 


On = (—i} v[R(2/+ 1)1]2 (1/1) sinð. (28.10) 


+ Each such function corresponds to a state in which /z does not have a definite value, but 
can have the values +m with equal probability. 
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Next, using (27.12), we write 


ie eee aa a ee Kies 
= V/[(l—m)(14+m4+)] Yin. 
A repeated application of this formula gives 


VIE mU] Yim = (2AL Yn. 


The right-hand side of this equation is easily calculated by means of the 
expression (26.15) for the operator L. We have 


EL f(@)etmd] = ef(m-D¢ sinl-m6 d( f sin™6)/d(cos 6). 


A repeated application of this formula gives 
(LeO, = ef? sin-™9 d-"(sin! @.©,,)/d(cos 6)-™. 


Finally, using these relations and the expression (28.10) for ©, we obtain 
the formula 


(2141)\(l+m)q 1 dem 
Pra) = (~ aj E 2(1—m)! da e aa 


which is the same as (28.5). 


§29. Matrix elements of vectors 


Let us again consider a closed system of particles;t let f be any scalar 
physical quantity characterizing the system, and f the operator corresponding 
to this quantity. Every scalar is invariant with respect to rotation of the 
coordinate system. Hence the scalar operator f does not vary when acted 
on by a rotation operator, i.e. it commutes with a rotation operator. We know, 
however, that the operator of an infinitely small rotation is the same, apart 
from a constant factor, as the angular momentum operator, so that 


iL) =o (29.1) 


From the commutability of f with the angular momentum operator it 
follows that the matrix of f with respect to transitions between states with 
definite values of L and M is diagonal with respect to these suffixes. More- 
over, since the specification of M defines only the orientation of the system 
relative to the axes of coordinates, and the value of a scalar is independent of 
this orientation, we can say that the matrix elements <n'’'LMj| f |nLM) are 
independent of the value of M; n conventionally denotes all the quantum 
numbers other than L and M which define the state of the system. A formal 


— 


t All the results in this section are valid also for a particle in a centrally symmetric field 
(and in general whenever the total angular momentum of the system ts conserved). 
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proof of this assertion can be obtained from the commutativity of the 
operators f and L+: 


fL,-L.f = 0. (29.2) 


Let us write down the matrix element of this equation corresponding to the 
transition 7, L, M—>n', L, M+1. Taking into account the fact that the 
matrix of the quantity L+ has only elements with n, L, M > n, L, M +1, we 
obtain 


<n’, L, M+1|f\|n, L, MW+1> <n, L, M4+1L4\n, L, M> = 
<n’, L, M+ 1{L.|n’, L, My <n’, L, M| fin, L, MY, 


and since the matrix elements of the quantity L+ are independent of the suffix 
n, we find . 


in’, L, M+i|fin, L, M+1> = <n’, L, M\f in, L, MY, (29.3) 


whence it follows that all the quantities <n’, L, M| f |n, L, M) for different M 
(the other suffixes being the same) are equal. 

If we apply this result to the Hamiltonian itself, we obtain our previous 
result that the energy of the stationary states is independent of M, i.e. that 
the energy levels have (2L + 1)-fold degeneracy. 

Next, let A be some vector physical quantity characterizing a closed 
system. When the system of coordinates is rotated (and, in particular, in 
an infinitely small rotation, 1.e, when the angular momentum operator 
is applied), the components of a vector are transformed into linear functions 
of one another. Hence, as a result of the commutation of the operators L 
with the operators A;, we must again obtain components of the same vector, 
Ai. The exact form can be found by noticing that, in the particular case 
where A is the radius vector of the particle, the formulae (26.4) must be 
obtained. Thus we find the commutation rules 


{Ls Ar} = teri. (29.4) 


These relations enable us to obtain several results concerning the form 
of the matrices of the components of the vector A (M. Born, W. Heisenberg 
and P. Jordan 1926). First of all, it is possible to derive selection rules which 
determine the transitions for which the matrix elements can be different 
from zero. We shall not go through the fairly lengthy calculations here, 
however, since it will appear later ($107) that these rules are actually a direct 
consequence of the general transformation properties of vector quantities 
and can be derived from the latter with hardly any calculation at all. Here 
we shall merely give the rules, without proof. 
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The matrix elements of all the components of a vector can be different 
from zero only for transitions in which the angular momentum L changes by 
not more than one unit: 


L>L or L+l. (29.5) 


There is a further selection rule which forbids transitions between any two 
states with L = 0. This rule is an obvious consequence of the complete 
spherical symmetry of states with angular momentum zero. 

The selection rules for the angular momentum component M are different 
for the different components of a vector: the matrix elements can be different 
from zero for transitions where M changes as follows: 


for <A, = Az+iAy,, M—>M+41, 
for A- = Az—IAy,, M > M-1, (29.6) 
for Az, M — AT. 


Moreover, it is possible to determine a general form for the matrix elements 
of a vector as functions of the number M. These important and frequently 
used formulae are given here, also without proof, since they are actually 
a particular case of more general relations derived in §107 for any tensor 


quantities. 
The non-zero matrix elements of the quantity A; are given by the formulae 
M 
n’LM|A,|nLM> = —————_——— <n L||Al|nL), 
¢n'LM|A,\n, L—1, MY | Selek <n'L\|Al|n, L—1), } (29.7) 
! -1, M); = j-_ n, L—1), ; 
l ae LOY ICL) 


| L? — M? 


j -1 M = a a 
<n’, L-1, M\Az|nLM) JLL- ene) 


<n’, L—1||Aj|nL>. 


Here the symbol <n'L'||AljnLy denotes a reduced matrix element, a quantity 
independent of the quantum number M.t These matrix elements are related 
by 


(n'L'\|Al|nL) = <nL\|Al|n'L'>*, (29.8) 


which follows directly from the fact that the operator A, is Hermitian. 
The matrix elements of the quantities A- and A; are also determined by 


+ The appearance in formulae (29.7) and (29.9) of denominators which depend on L is in 
accordance with the general notation used in §107. The convenience of these denominators 
is shown, in particular, by the simple form of equation (29.12) for the matrix elements of the 
scalar product of two vectors. 

The symbol for the reduced matrix element is to be taken as a whole, in contrast to the 
matrix element symbol (see the comments following (11.17)). 
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the reduced matrix elements. The non-zero matrix elements of A- are 
<n’, L, M—1|A-|nLM) 


= [EE cn’ LAlaL), 
E DELFI) 


n L, M—1\A_|n, L—1, MD 


£ Jp en LjAlan L-1 t 29 
LOB- 2b4+1) 


in’, L-1, M—1|A_|nLM) 


= gts. asi <n’, L—1||Al|nL). 
L(2L —1)(2L +1) 


The matrix elements of A + need not be written out separately: since A; and 
Ay are real we have 


<n'L'M’|A4|nLM)> = (nLM|A-_|n'L'M’)*. (29.10) 
There is a formula which expresses the matrix elements of the scalar A. B 
in terms of the reduced matrix elements of the two vector quantities A and 


B. The calculation is conveniently carried out by writing the operator 
A . Bin the form 


A.B 


YAB- tA Borb (29.11) 


The matrix of A.B (like that of any scalar) is diagonal with respect to L 
and M. A calculation by means of formulae (29.7)-(29.9) gives the result 


1 
(n' LM|A.BInLM) = > e'Ll|Aljn"L"<n"L"||B||nL), (29.12) 
n L” 


2L +1 


where L” takes the values L, L + 1. 
For reference, we shall give the reduced matrix elements for the vector L 
itself. A comparison of (29.9) and (27.12) shows that 


CLIL|L> = vIL(L + 12h + 1)), 


} (29.13) 
(L-AlL||L> = <LI|LI|L—1> = 0. 


A quantity that often occurs in applications is the unit vector n along the 
radius vector of the particle. Its reduced matrix elements can be calculated 
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by finding, for example, the matrix elements of mz = cos @ for a zero angular- 
momentum component, m = 0; 


1-1, Ojnz|l0> = Í ©r-1.0* cos 8.O sin 648, 
0 


with the functions © 9 given by (28.11). The evaluation of the integralt gives 
<l—1, O|nz\10> = al/4/[(21— 1)(27 + 1)]. 


The matrix elements for transitions / —> lare zero (as for any polar vector of 
an individual particle; see (30.8) below). Comparison with (29.7) then gives 


l— lji = —Cl\|n|[l—-1> = ivi, 
<l— liini) Cilla- 1> = iv } (29.14) 
<lin = 0. 


PROBLEM 
Average the tensor nink —4 ôix (where n is a unit vector along the radius vector of a particle) 
over a state where the magnitude but not the direction of the vector I is given (1.e. /z is 
indeterminate). 


SoLution., The required mean value is an operator which can be expressed in terms of 
the operator I alone. We seek it in the form 


mtg ~ bôi = afl + hbi—3Sall+1)); 


this is the most general symmetrical tensi: of rank two with zero trace that can be formed 
from the components of 1. To determine the constant a we multiply this equation on the left 
by Js and on the right by fe (summing over i and k). Since the vector n is perpendicular to 
the vector Al = fx fp, we have nit = 0. The product [ifidif, = (1°)? is replaced by its 
eigenvalue /*(1+1)*, and the product Libelily is transformed by means of the commutation 
relations (26.7) as follows: 


Liblib = [didele—ievcibibls 
= (12)? diel — hh) 
= (12)? + heixieiwmlilm 
= (122-12 
= [2(1+1)?~1(+1) 


(using the fact that eiktemkı = 251m). After a simple reduction we obtain the result 


a = —1/(21—1)(21+3). 


$30. Parity of a state 
Besides the parallel displacements and rotations of the coordinate system, 
the invariance under which represents the homogeneity and isotropy of space 


t By /—1 times integrating by parts with d cos 6; the general formula for integrals of this 
type is (107.14). 
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respectively, there is another transformation which leaves unaltered the 
Hamiltonian of a closed system. This is what is called the inversion trans- 
formation, which consists in simultaneously changing the sign of all the 
coordinates, 1.e. a reversal of the direction of each coordinate axis; a right- 
handed coordinate system then becomes left-handed, and vice versa. The 
invariance of the Hamiltonian under this transformation expresses the 
symmetry of space under mirror reflections.t In classical mechanics, the 
invariance of Hamilton’s function with respect to inversion does not lead to a 
conservation law, but the situation is different in quantum mechanics. 

Let us denote by Ê (for ‘‘parity’’) an inversion operator whose effect on a 
wave function i(r) is to change the sign of the coordinates: 


P(r) = p-r). (30.1) 


It is easy to find the eigenvalues P of this operator, which are determined 
by the equation 


P(r) = Pur). (30.2) 


To do so, we notice that a double application of the inversion operator 
amounts to identity: the argument of the function is unchanged. In other 
words, we have P?ẹ = P2ys = , 1.e. P? = 1, whence 


P= +1. (30.3) 


Thus the eigenfunctions of the inversion operator are either unchanged or 
change in sign when acted upon by this operator. In the first case, the wave 
function (and the corresponding state) is said to be even, and in the second it 
is said to be odd. 

The invariance of the Hamiltonian under inversion (i.e. the fact that the 
operators H and Ê commute) thus expresses the law of conservation of parity: 
if the state of a closed system has a definite parity (i.e. if it is even, or odd) 
then this parity is conserved in the course of time. 

The angular momentum operator also is invariant under inversion, which 
changes the sign of the coordinates and of the operators of differentiation with 
respect to them; the operator (26.2) thus remains unaltered. In other words, 
the inversion operator commutes with the angular momentum operator, and 
this means that the system can have a definite parity simultaneously with 
definite values of the angular momentum L and its component M. All states 
that differ only in the value of .W have the same parity; this is evident because 
the properties of a closed system are independent of its orientation in space, 


t Invariance under inversion exists also for the Hamiltonian of a svstem of particles in a 
centrally symmetric feld with the centre at the origin. 

t To avoid misunderstanding, it should be mentioned that this refers to the non-relativistic 
theory. There exist interactions in Nature, falling in the realm of relativistic theory, which 
violate the conservation of parity 
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and it can be formally demonstrated from the commutation rule L ,.P-—- PL, 
= 0 by the same method as in deriving (29.3) from (29.2). 

There are specific parity selection rules for the matrix elements of various 
physical quantities. Let us first consider scalars. Here we must distinguish 
true scalars, which are unchanged by inversion, from pseudoscalars, which 
change sign, for instance the scalar product of an axial and a polar vector. 
The operator of a true scalar f commutes with P; hence it follows that, if the 
matrix of P is diagonal, then the matrix of f is diagonal also as regards the 
parity suffix, i.e. the matrix elements are zero except for transitions g > g and 
u >u (where g and u denote even and odd states respectively). For the 
operator of a pseudoscalar quantity, we have Pf = — fP; the operators Pand ` 
f anticommute. The matrix element of this equation for a transition g —> g is 
Poofag = —fogPgg, and so fgg = O since Pog = 1. Similarly we find that 
fuu = 0. Thus, in the matrix of a pseudoscalar quantity, only those elements 
can be different from zero which correspond to transitions with change of 
parity. The selection rules for the matrix elements of scalars are therefore: 


true scalars g >g, u >u; 


} (30.4) 


pseudoscalars g > u, u >g. 


These rules can also be obtained directly from the definition of the matrix 
elements. Let us consider, for example, the integral fug = f Yu*fpo dq, 
where the function wy is even and Yu odd. When all the coordinates change 
sign, the integrand does so if fis a true scalar; on the other hand, the integral 
taken over all space cannot change when the variables of integration are 
renamed. Hence it follows that fug = —/fug, i.e. fug = 0 

We can similarly derive selection rules for vector quantities. Here it must 
be recalled that ordinary (polar) vectors change sign on inversion, while 
axial vectors (such as the angular momentum vector, which is the vector 
product of the two polar vectors p and r) are unchanged by inversion. The 
selection rules are found to be: 


polar vectors g >u, u >g; 


; (30.5) 


axial vectors g >p, u >u. 


Let us determine the parity of the state of a single particle with angular 
momentum /, The inversion transformation (x > —x,y > —y, zZ > —2) 
is, in spherical polar coordinates, the transformation 


r—>r, 8 >n—b, d>74+¢. (30.6) 


The dependence of the wave function of the particle on the angle is given by 
the spherical harmonic Yim, which, apart from a constant that is here 
unimportant, has the form P;™(cos 6)e*¢. When ¢ is replaced by 7+¢, 
the factor etme is multiplied by (— 1)”, and when @ is replaced by 7— 6, 
P”(cos 0) becomes P”(—cos 6) = (—1)!-™P);™(cos 0). Thus the whole 
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function is multiplied by (—1)! (independent of m, in agreement with what 
was said above), i.e. the parity of a state with a given value of / is 


|e ee ED (30.7) 


We see that all states with even / are even, and all those with ode / are odd. 

A vector physical quantity relating to an individual particle can have non- 
zero matrix elements only for transitions with / > / or /41 (§29). Remem- 
bering this, and comparing formula (30.7) with what was said above regarding 
the change of parity in the matrix elements of vectors, we reach the result. 
that the matrix elements of vectors relating to an individual particle are zero 
except for the transitions: 
polar vectors /+>1l+1; (30.8) 


axial vectors EER 


§31. Addition of angular momenta 


Let us consider a system composed of two parts whose interaction is weak. 
If the interaction is entirely neglected, then for each part the law of conserva- 
tion of angular momentum holds. The angular momentum L of the whole 
system can be regarded as the sum of the angular momenta L and Le of its 
parts. In the next approximation, when the weak interaction is taken into 
account, Lı and Lg are not exactly conserved, but the numbers Lı and Le 
which determine their squares remain “good” quantum numbers suitable for 
an approximate description of the state of the system. Regarding the angular 
momenta in a classical manner, we can say that in this approximation L, and 
Lz rotate round the direction of L while remaining unchanged in magnitude. 

For such systems the question arises regarding the “law of addition” of 
angular momenta: what are the possible values of L for given values of Lı 
and L2? The law of addition for the components of angular momentum is 
evident: since L, = L,,+ Laz, it follows that 


M = Wh + Mo. (31.1) 


There is no such simple relation for the operators of the squared angular 
momenta, however, and to derive their ‘‘law of addition” we reason as 
follows. 

If we take the quantities L,?, L,?, Liz Loz as a complete set of physi- 
cal quantities,t every state will be determined by the values of the numbers 
Li, La, Mis, M,. For given L and L, the numbers M, and M, take (20, +1) 
and (2£,+1) different values respectively, so that there are altogether 
(2L,+1)(2L,+1) different states with the same L and Lẹ. We denote the 
wave functions of the states for this representation by ¢7, 744,14. 

Instead of the above four quantities, we can take the four quantities 
L,?, L2?, L?, Lz as a complete set. Then every state is characterized by 


+ Together with such other quantities as form a complete set when combined with these 
tour. These other quantities play no part in the subsequent discussion, and for brevity we 
shall ignore them entirely, and conventionally call the above four quantities a complete set. 
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the values of the numbers L,, La, L, M (we denote the corresponding wave 
functions by %z 77). For given L, and L, there must of course be 
(2L,+1)(2Z,+1) different states as before, i.e. for given L and L, the pair 
of numbers L and M must take (2L,+1)(2L,+1) pairs of values. These 
values can be determined as follows. 

By adding the various possible values of Mı and Mp, we get the corres- 
ponding values ot M, as shown below: 





L, L, L,+L, 
E; L,—1 L,+L.—-1 
L= is capa 

L: L,—2 

L,-1 L,—1 L,+L,—2 
L,-2 Es 


We see that the greatest possible value of Mis M = Li + Le, corresponding 
to one state ¢ (one pair of values of Mı and M2). The greatest possible value 
of M in the states p, and hence the greatest possible value of L also, is there- 
fore £1+L2. Next, there are two states ġ with M = L,+L2—1. Con- 
sequently, there must also be two states # with this value of 17; one of them 
is the state with L = L,+L2(and M = L—1), andthe otheristhat with L = 
Lı+L—1 (and M = L). For the value M = Li+ L-2 there are three 
different states ¢. This means that, besides the values L = Li+ Le, L = 
Li+ L-1, the value L = Li+ L-2 can occur. 

The argument can be continued in this way so long as a decrease of M by 
1 increases by 1 the number of states with a given M. It is easily seen that 
this is so until M reaches the value |L —L2|. When M decreases further, the 
number of states no longer increases, remaining equal to 2L2+1 (if Le < Li). 
Thus |L — Lol is the least possible value of L, and we arrive at the result 
that, for given Lı and Le, the number L can take the values 


AEE E Y E E E EA (31.2) 


that is 2L,+1 different values altogether (supposing that L < L). It is 
easy to verify that we do in fact obtain (2Z,+1)(2£,41) different values of 
the pair of numbers M, L. Here it is important to note that, if we ignore 
the 2L+1 values of M for a given L, then only one state will correspond to 
each of the possible values (31.2) of L. 

This result can be illustrated by means of what is called the vector model. 
If we take two vectors L,, L, of lengths L, and L,, then the values of Z are 
represented by the integral lengths of the vectors L which are obtained by 
vector addition of L, and L}; the greatest value of L is L +L, which is 
obtained when L, and L, are parallel, and the least value is |Z,—Z,|, when 
Lı and L, are antiparallel. 

In states with definite values of the angular momenta L,, L, and of the 
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total angular momentum L, the scalar products L . La, L . L} and L . L, also 
have definite values. These values are easily found. To calculate L . L, 
we write L = L, +L, or, squaring and transposing, 


n a Bae ee 


Replacing the operators on the right-hand side of this equation by their 
eigenvalues, we obtain the eigenvalue of the operator on the left-hand side: 


L,.L, = XL(L+1)—L,(L,+1)—L,(L,+ 1)}. (31.3) 
Similarly we find 
Leb se PRL. (31.4) 


Let us now determine the ‘‘addition rule for parities”. The wave “unction 
Y of a system consisting of two independent parts is the product of the wave 
functions Yı and Y2 of these parts. Hence it is clear that if the latter are of 
the same parity (1.e. both change sign, or both do not change sign, when the 
sign of all the coordinates is reversed), then the wave function of the whole 
system is even. On the other hand, if Yı and ‘V2 are of opposite parity, then 
the function ¥ is odd. These statements may be written 


P & PP, (31.5) 


where P is the parity of the whole system and P1, P2 those of its parts. This 
rule can, of course, be generalized at once to the case of a system composed of 
any number of non-interacting parts. 

In particular, if we are concerned with a system of particles in a centrally 
symmetric field (the mutual interaction of the particles being supposed weak), 
then the parity of the state of the whole system is given by 


P=(- ])tlet... : (31.6) 


see (30.7). We emphasize that the exponent here contains the algebraic sum 
of the angular momenta /;, and this is not in general the same as their “‘vector 
sum”, 1.e. the angular momentum L of the system. 

If a closed system disintegrates (under the action of internal forces), the 
total angular momentum and parity must be conserved. This circumstance 
may render it impossible for a system to disintegrate, even if this is energetic- 
ally possible. 

For instance, let us consider an atom in an even state with angular momen- 
tum L = 0, which is able, so far as energy considerations go, to disintegrate 
into a free electron and an ion in an odd state with the same angular momen- 
tum L = 0. It is easy to see that in fact no such disintegration can occur 
(it is, as we say, forbidden). For, by virtue of the law of conservation of angu- 
lar momentum, the free electron would also have to have zero angular momen- 
tum, and therefore be in an even state (P = (— 1)? = +1); the state of the 
system ion +electron would then be odd, however, whereas the original state 
of the atom was even. 


CHAPTER V 


MOTION IN A 
CENTRALLY SYMMETRIC FIELD 


§32. Motion in a centrally symmetric field 


THE problem of the motion of two interacting particles can be reduced in 
quantum mechanics to that of one particle, as can be done in classical mech- 
anics. The Hamiltonian of the two particles (of masses mj, me) interacting in 
accordance with the law U(r) (where r is the distance between the particles) 
is of the form 


h? h? 
H = ——A,——A,+ U(r), (32.1) 
2m, 2m, 


where A, and Ag are the Laplacian operators with respect to the coordinates 
of the particles. Instead of the radius vectors r, and r, of the particles, we 
introduce new variables R and r: 


r =r r, R = (mr +m) (m4m); (32.2) 


r is the vector of the distance between the particles, and R the radius vector 
of their centre of mass. A simple calculation gives 


h? aan 
SS ef Ma U(r), 32.3 
2(m,+me) i 2m TEN ( ) 


where Ôp and A are the Laplacian operators with respect to the components 
of the vectors R and r respectively, m, +m, is thetotal mass of the system, and 
m = mymo/(m,+me) is the reduced mass. ‘Thus the Hamiltonian falls into 
the sum of two independent parts. Hence we can look for (rj, r2) in the 
form of a product ¢(R)¢(r), where the function ¢(R) describes the motion 
of the centre of mass (as a free particle of mass mı + m2), and (r) describes 
the relative motion of the particles (as a particle of mass m moving in the cen- 
trally symmetric field U(r)). 

Schrédinger’s equation for the motion of a particle in a centrally sym- 
metric field is 


A+ (2m/h®)[E— U(r) = 0. (32.4) 


Using the familiar expression for the Laplacian operator in spherical polar 
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coordinates, we can write this equation in the form 


1 8a IE 7 r 
7? ôr PEH r? Lsing TN Tinto sin? @ ar i E A 
(32.5) 





If we introduce here the operator (26.16) of the squared angular momentum 
we obtain 

h? fs 

a at H) 4g | + UNH = Ey. (32.6) 

2mL 72 ar 

The angular momentum is conserved during motion in a centrally sym- 

metric field. We shall consider stationary states in which the angular 
momentum / and the component m have definite values. These values 
determine the angular dependence of the wave functions. We thus seek 
solutions of equation (32.6) in the form 


Y = R(r)Yim(, $), (32.7) 


where the Yım(0, $) are spherical harmonic functions. 
Since ÊY n = (1+1)Y m, we obtain for the radial function R(r) the equa- 
tion 





1d dry Ki+t) , 
r-— 


r2 dr dr r? 


+z- U(r)]R = 0. (32.8) 


This equation does not contain the value of lz = m at all, in accordance with 
the (2/+ 1)-fold degeneracy of the levels with respect to the directions of the 
angular momentum, with which we are already familiar. 

Let us investigate the radial part of the wave functions. By the substitu- 
tion 





R(t) = x(r)ir (32.9) 
equation (32.8) is Ae to the form 
(1+1 
x [pe an al ls 2.6: (32.10) 


If the potential energy U(r) is everywhere finite, the wave function % must 
also be finite in all space, including the origin, and consequently so must its 


t If we introduce the operator of the radial component py of the linear momentum, in the 
form 


y ele re ame 
i= Say = ~in(—+—\a, 
ror Or r 
the Hamiltonian can be written in the form 
Ê = (1/2m)(p,2-+ 22/7) + U(r), 


which is the same in form as the classical Hamilton’s function ın spherical polar coordinates. 
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radial part R(r). Hence it follows that y(r) must vanish for r = 0: 


x(0) = 0. (32.11) 


This condition actually holds also (see §35) for a field which becomes infinite 
as r > 0. 

Equation (32.10) is formally identical with Schrédinger’s equation for 
one-dimensional motion in a field of potential energy 


he I(i+1 
Ui) =U) 





; (32.12) 
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which is the sum of the energy U(r) and a term 
hel(14-1)/2mr? = h?1?/2mr?, 


which may be called the centrifugal energy. Thus the problem of motion in a 
centrally symmetric field reduces to that of one-dimensional motion in a 
region bounded on one side (the boundary condition for r = 0). The nor- 
malization condition for the function y is also “one-dimensional”, being 
determined by the integral 


te 2) [s 


f IR|?r? dr = Í |x|? dr. 
0 0 


In one-dimensional motion in a region bounded on one side, the energy 
levels are not degenerate (§21). Hence we can say that, if the energy is given, 
the solution of equation (32.10), i.e. the radial part of the wave function, is 
completely determined. Bearing in mind also that the angular part of the 
wave function is completely determined by the values of ? and m, we reach 
the conclusion that, for motion in a centrally symmetric field, the wave func- 
tion is completely determined by the values of E, land m. In other words, 
the energy, the squared angular momentum and the z-component of the 
angular momentum together form a complete set of physical quantities for 
such a motion. 

The reduction of the problem of motion in a centrally symmetric field to a 
one-dimensional problem enables us to apply the oscillation theorem (see 
§21). We arrange the eigenvalues of the energy (discrete spectrum) for a 
given lin order of increasing magnitude, and give them numbersm,, the lowest 
level being given the number nr = 0. Then m, determines the number of 
nodes of the radial part of the wave function for finite values of r (excluding 
the point r = 0). The number n, is called the radial quantum number. The 
number / for motion in a centrally symmetric field is sometimes called the 
azimuthal quantum number, and m the magnetic quantum number. 

There is an accepted notation for states with various values of the angular 
momentum / of the particle: they are denoted by Latin letters, as follows: 


1=01234567.. 
spdfghik... 


(32.13) 
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The normal state of a particle moving in a centrally symmetric field is 
always the s state; for, if Z Æ 0, the angular part of the wave function in- 
variably has nodes, whereas the wave function of the normal state can have 
no nodes. We can also say that the least possible eigenvalue of the energy, 
for a given /, increases with /. This follows from the fact that the presence 
of an angular momentum involves the addition of the essentially positive 
term f?/(1+-1)/2mr?, which increases with /, to the Hamiltonian. 

Let us determine the form of the radial function near the origin. Here 
we Shall suppose that 


lim U(r)? = 0. (32.14) 


We seek R(r) in the form of a power series in 7, retaining only the first term 
of the series for small r; in other words, we seek R(r) in the form R = con- 
stant x7*. Substituting this in the equation 


d(r? dR/dr)/dr—I(1+1)R = 0, 


which is obtained from (32.8) by multiplying by r? and taking the limit as 
r > 0, we find 
s(s+1) = (+1). 
Hence 
s=] or s = —(/+1). 


The solution with s = —(/+1) does not satisfy the necessary conditions; 
it becomes infinite for r = 0 (we recall that Z > 0). Thus the solution with 
s = l remains, i.e. near the origin the wave functions of states with a given Z 
are proportional to 7': 


Rı =~ constant xrl. (32.15) 


The probability of a particle’s being at a distance between 7 and r-+dr from 
the centre is determined by the value of r?|R|? and is thus proportional to 
y2(t+1) We see that it becomes zero at the origin the more rapidly, the 
greater the value of l. 


§33. Spherical waves 
The plane wave 


Wp = constant xe@/™p.r 


describes a stationary state in which a free particle has a definite momentum p 
(and energy E = p?/2m). Let us now consider stationary states of a free 
particle in which it has a definite value, not only of the energy, but also of the 
absolute value and component of the angular momentum. Instead of the 
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energy, it is convenient to introduce the wave number 
k =plh = V(2mE)/h. (33.1) 


The wave function of a state with angular momentum l and projection 
thereof m has the form 





Prim = Ril) Yim(8, $), (33.2) 
where the radial function is determined by the equation 
2 (i+) 
Ra + Rat | He z |Ra =0 (33.3) 


(equation (32.8) with U(r) = 0). The wave functions prim for the continuous 
(with respect to k) spectrum satisfy the conditions of normalization and 
orthogonality: 





T 


Í Pr'tem*WDkim dV = 818mm 8( = *). 


The orthogonality for different /, l' and m, m’ is ensured by the angular func- 
tions. The radial functions must be normalized by the condition 

r k'—k l 

[PReaRu dr = (=) = 2n8(k’ —k). (33.4) 

TT 

if] 
If we normalize the wave functions, not on the “‘k/27 scale”, but on the 
“energy scale”, i.e. by the condition 

f ReRe dr = 8(E'—E), 


then, by the general formula (5.14), we have 





Ret = Ra v(1/27)v(dk/dE) = (1/h)V(m/2n0k)Ri (33.5) 
For l = 0, equation (33.3) can be written 
d?(7Rro) 
dz +R R io = 0; 


its solution finite for r = 0 and normalized by the condition (33.4) is (cf. 


(21.9)) 
Rei (33.6) 





To solve equation (33.3) with Z # 0, we make the substitution 
Ry = Xia (33.7) 
For xx we have the equation 


Xm + 21+ 1)xr r+ key = 0. 
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If we differentiate this equation with respect to 7, we obtain 


A ma EE an 
Xkt +> Jw =0 = 








By the substitution x,;' = 7x, 41 It becomes 


2(1+-2) 





Xrti + Xk, +1 "+k? Xk = 0, 


which is in fact the equation satisfied by x, ;,;. Thus the successive func- 
tions xx; are related by 


Xntty = Xu İf, (3 3.8) 


(- d\! 
Xk = š ay X ko» 


and hence 


where xko = R pgo 18 determined by formula (33.6) (this expression can, of 
course, be multiplied by an arbitrary constant). 

Thus we finally have the following expression for the radial functions in 
the free motion of a particle: 





(33.9) 


‚2r! 1 d\! sinkr 
Ra =(= =(- 


kt \r dr r 
(the factor k~! is introduced for normalization purposes—see below— and 
the factor ( — 1)! for convenience). The functions (33.9) can be expressed in 
terms of Bessel functions of half-integral order, in the form 


Rp = v (2rkir) Ji-i(kr) = 2kj(kr); (33.10) 
the functions 


Jx) = vri 2x) Ji +(x) (33.11) 


are called spherical Bessel functions.T 

To obtain an asymptotic expression for the radial function (33.9) at large 
distances, we notice that the term which decreases least rapidly as r > œ is 
obtained by differentiating the sine / times. Since each differentiation — d/dr 
of the sine adds — $7 to its argument, we have the following asymptotic 
expression: 


È 2 sin (kr — alm) 





Rit (33.12) 
r 
t The first few of these are 
sin x snx cosx 3 | 3 cos x 
jo = — j= 3 - —, p= —— jSsin x — ae 8 
x x x x x 


Functions defined as x times these are also sometimes used 
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The normalization of the functions Rx; can be effected by means of their 
asymptotic expressions, as was explained in §21. Comparing the asymptotic 
formula (33.12) with the normalized function Ryo (33.6), we see that the func- 
tions Rx, with the coefficient used in (33.9), are in fact normalized as they 
should be. 

Near the origin (7 small) we have, expanding sin kr in series and retaining 
only the term containing the lowest power of 7 after the differentiation,t 





e d \' sin kr 


r dr r 





R T buaka 
mh Ges (21-41)! 


=( — IŅk2 +1 (21+ 1)!!. 
Thus the functions Rx near the origin have the form 


+1 
Br a (33.13) 


(2141)! 


in agreement with the general result (32.15). 

In some problems (of scattering theory) it is necessary to consider wave 
functions which do not satisfy the usual conditions of finiteness, but corres- 
pond to a flux of particles from or to the centre. The wave function which 
describes such a flux of particles with angular momentum / = 0 is obtained 
by taking, instead of the ‘‘stationary spherical wave” (33.6), a solution in the 
form of an outgoing spherical wave Rxo* or an ingoing spherical wave Rxo7, 
with 

Reo* = (Alrje**. (33.14) 


In the general case of an angular momentum / which is not zero, we obtain 
a solution of equation (33.3) in the form 





= 
wt =(— was(-+ (33.15) 
rdr/ r 
These functions can be expressed in terms of Hankel functions: 
Ru* = +iAv(kn|2r)H p (kr), (33.16) 
+1 


of the first and second kinds for the signs + and — respectively. 


t The symboi !! denotes the product of all integers of the same parity up to and including 
the number in question. 
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The asymptotic expression for these functions 1s 


Rat œ Aetter—tr/2)/y, (33:17) 
Near the origin, it has the form 
21—1)!! 
Ryt ~ a, (3 3.18) 


We normalize these functions so that they correspond to the emission (or 
absorption) of one particle per unit time. To do so, we notice that, at large 
distances, the spherical wave can be regarded as plane in any small interval, 
and the current density initisj = vpy*, where v = kh/mis the velocity of a 
particle. The normalization is determined by the condition $j df = 1, 
where the integration is carried out over a spherical surface of large radius 7, 
i.e. fjr? do = 1, where do is an element of solid angle. If the angular func- 
tions are normalized as before, the coefficient A in the radial function must 
be put equal to 


A =1/vv = v(mjkħ). (33.19) 


An asymptotic expression similar to (33.12) holds, not only for the radial 
part of the wave function of free motion, but also for motion (with positive 
energy) in any field which falls off sufficiently rapidly with distance.t At 
large distances we can neglect both the field and the centrifugal energy in 
Schrédinger’s equation, and there remains the approximate equation 


1 d?(rRj) 


2 = 
-— 5 HER = 0. 
The general solution of this equation is 


2 sin(kr — ġir + ô) 


rT 


Rr ~ (33.20) 


where &, is a constant, called the phase shift, and the common factor is chosen 
in accordance with the normalization of the wave function on the “k/27 
scale”.t The constant phase shift ô is determined by the boundary con- 
dition (Rx is finite as 7 > 0); to do this, the exact Schrédinger’s equation 
must be solved, and 8; cannot be calculated in a general form. The phase 
shifts 5; are, of course, functions of both / and k, and are an important 
property of the eigenfunctions of the continuous spectrum. 


+ As we shall show in §124, the field must decrease more rapidly than 1/r. 

t The term —Hz7 in the argument of the sine is added so that 6; = 0 when the field is 
absent. Since the sign of the wave function as a whole is not significant, the phase shifts 8: 
are determined to within nm (not 2nm). Their values may therefore always be chosen in the 
range between 0 and v. 
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PROBLEMS 


PROBLEM 1. Determine the energy levels for the motion of a particle with angular momen- 
tum / = 0 in a spherical square potential well: 


U(r) = —U, for r <a, U(r) = 0 forr >a, 


SoLution. For / = 0 the wave functions depend only onr. Inside the well, Schrédinger's 
equation has the form 


1 d? l 
soa bey =0, k= pet IED): 


The solution finite for r = 0 is 


H 





The solution vanishing at infinity ie 
p = A'e ir. 
The condition of the continuity of the logarithmic derivative of ry at r = a gives 


k cot ka = —x = —¥{(2mU,/h?)—F*}, (1) 


or 
sinka = + /(h?/2ma*U,)ka. (2) 


This equation determines in implicit form the required energy levels (we must take those 
roots of the equation for which cot ka < 0, as follows from (1)). The first of these levels 
(with l = 0) is at the same time the deepest of all energy levels whatsoever, i.e. it corresponds 
to the normal state of the particle. 

If the depth U, of the potential well is small enough, there are no levels of negative energy, 
and the particle cannot “‘stay’’ in the well. This is easily seen from equation (2), by means of 
the following graphical construction. The roots of an equation of the form +sin x = a 
are given by the points of intersection of the line y = ax with the curves y = +sin x, and 
we must take only those points of intersection for which cot x < 0; the corresponding parts 
of the curve y = sin x are shown in Fig. 9 by a continuous line. We see that, if g is 
sufficiently large (U, small), there are no such points of intersection. The first such point 
appears when the line y = gx occupies the position shown, i.e. for « = 2/7, and is at x = 47. 





`N finx N / 


Fic. 9 


Putting « = h/4/(2ma?U,), x = ka, we hence obtain for the minimum well depth to give a 
single negative level 
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This quantity increases with decreasing well radius a. The position of the first level 
E, at the point where it first appears is determined from ka = $r and is E, = 0, as we should 
expect. As the well depth increases further, the normal level E, descends. When the difference 


A = (Uo/ Uo min) — 1 ts small, 
-E1 = (777/16) U 0, min A?. (4) 


PROBLEM 2. Determine the order of the energy levels with various values of the angular 
momentum / in a very deep spherical potential well (U, > f?/ma*) (W. Elsasser 1933). 


SOLUTION. The condition at the boundary of the well requires that yy > 0 as Up > œ 
(see §22). Writing the radial wave function within the well in the form (33.10), we thus have 
the equation 

Jizrso(ka) = 0, 


whose roots give the position of the levels above the bottom of the well (U,)—|E| = f?k?/2m) 
for various values of /. The order of the levels from-the ground state 1s found to be 


Is, 1p, ld, 2s, 1f, 2p, 1g, 2d, 1h, 35, 2f, o.. 


The numbers preceding the letters give the sequence of levels for each /.f 
PROBLEM 3. Determine the order of appearance of levels with various J as the depth 
Uo of the well increases. 


SoLUTION. When it first appears, each new level has energy E = 0. The corresponding 
wave function in the region outside the well, which vanishes as r — ©, is 


R = constant x r~i 


(the solution of equation (33.3) with k = 0). From the continuity of Rı and Rr at the boundary 
of the well it follows, in particular, that the derivative (r'+'R,)’ is continuous, and so we have 
the following condition for the wave function within the well: 


(rR) =0 for r=a. 


This is equivalentt to the condition for the function Ri_, to vanish and, from (33.10), we 
obtain the equation 


Ferfay(2mlo)h) = 0; 


for l = 0 the function Ji-1/2 must be replaced by the cosine. This gives the following order 
of appearance of new levels as Uo increases: 


Leal ldr 2s Vf Sp. 1g. 2d, 3s ih; 2f 5.2 


It may be noted that differences from the order of ievels in a deep well occur only for compar- 
atively high levels. 


PROBLEM 4. Determine the energy levels of a three-dimensional oscillator (a particle in a 
field U = }yw*r?), their degrees of degeneracy, and the possible values of the orbital angular 
momentum in the corresponding stationary states. 

SOLUTION. Schrédinger’s equation for a particle in a field U = bypw(x?+y?+2?) allows 
separation of the variahles, leading to three equations like that of a linear oscillator. The 
energy levels are therefore 


E, = ofn tn tnt |) = heat i). 


The degree of degeneracy of the nth level is equal to the number of ways in which n can be 
divided into the sum of three positive integral (or zero) numbers; || this is 


X(n+1)(n+2). 


t This notation is customary for particle levels in the nucleus (see §118). 

t According to (33.7) and (33.8) we have (r-'Ri)’ = r~'Ri,,. Since the equation (33.3) 
is unaltered when / is replaced by —/—1, we also have (r'+!R_1_,)’ = r'+1R_i. Finally, since 
the functions R_; and Ri_, satisfy the same equation, we obtain (r! +R) = r'*+4Ri_), the 
formula used in the text. 

|| In other words, this is the number of ways in which n similar balls can be distributed 
among three urns. 
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The wave functions of the stationary states are 
dnnn, = constant x e-e" 2H p (ax)H» (ay) Hn (a2), (1) 


where o = +/(mw/h) and m is the mass of the particle. When the sign of the coordinate 
is changed, the polynomial Hn is multiplied by (—1)". The parity of the function (1) is 
therefore (—1)%+%*" = (—1)". Taking linear combinations of these functions with a 
given sum m+n2+n3 = n, we can form the functions 


Pnim = constant x e~2F? 271] egr? +... ten !) Vim(6, $)F( —3n+ 4, 1+-8,0°?), (2) 


where |m| = 0, 1, ..., / and l takes the values 0, 2, ..., n for even n and 1, 3, ..., n for odd n; 
F is the confluent hypergeometric function. This is evident from a comparison of the parities 
(—1)" of the functions (1) and (—1)' of the functions (2), which must be the same. This 
determines the possible values of the orbital angular momentum corresponding to the energy 
levels considered. 

The order of levels of the three-dimensional oscillator is, therefore, with the same notation 
as in Problems 2 and 3, 


(1s), (1p), (1d, 25), (1f, 2p), (Ig, 24, 39), =- 


where the parentheses enclose sets of degenerate statesT 


§34. Resolution of a plane wave 


Let us consider a free particle moving with a given momentum p = kk 
in the positive direction of the z-axis. The wave function of such a particle 
is of the form 

% = constant x et Xz, 


Let us expand this function in terms of the wave functions 4 g m of free motion 
with various angular momenta. Since, in the state considered, the energy 
has the definite value k?/?/2m, it is clear that only functions with this k will 
appear in the required expansion. Moreover, since the function e** has 
axial symmetry about the z-axis, its expansion can contain only functions 
independent of the angle ¢, i.e. functions with m = 0. Thus we must have 


[° a] (ea) 
&kz = } abro = X aR Yio, 
l=0 l=ọ0 


where the a; are constants. Substituting the expressions (28.8) and (33.9) 
for the functions Y;9 and Rx, we obtain 


= r\' 71 d\' sin kr 
gik? — C,P,(cos 8 (=) (-= z = r cosl), 
2. ái ) k r dr kr ( 





where the C, are other constants. These constants are conveniently deter- 
mined by comparing the coefficients of (r cos 6)" in the expansions of the two 
sides of the equation in powers of r. On the right-hand side of the equation 
this term occurs only in the mth summand; for l > n, the expansion of the 
radial function begins at a higher power of r, while for l < n the polynomial 


+ Note that levels with different angular momenta l are mutually degenerate; see the 
footnote at the end of §36. 
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P, (cos 8) contains only lower powers of cos 8. The term in cos? 6 in P,(cos 8) 
has the coefficient (2/)!/2'(/!)2 (see formula (c.1)). Using also formula 
(33.13), we find the desired term of the expansion of the right-hand side of 
the equation to be 

(21)! (kr cos 6)! 


Cle. 
24(18)2 (20-4 1)!! 


On the left-hand side of the equation the corresponding term (in the expansion 
of eth? cos 6) is 

(ikr cos 6)#/I!. 
Equating these two quantities, we find C, = (—i)(2/+1). Thus we finally 
obtain the required expansion: 


etkz _ > (—i)(21+ 1)P,(cos 8) OC =) anal (34.1) 


=p kr 





At large distances this relation takes the asymptotic form 


1 2 
eik? me > (214 1)P,(cos 8) sin(kr—}lz). (34.2) 
kr 


[=O 


In (34.1) the z-axis is in the direction of the wave vector k of the plane 
wave. This expansion can also be written in a more general form which does 
not presur pose a particular choice of the coordinate axes. For this purpose 
we must use the addition theorem for spherical harmonics (see (c.11)) to 
express the polynomials P; (cos 6) in terms of spherical harmonic functions 
of the directions of k and r (the angle between which is 6). The result is 


fo 9) E 
eter = 4n E E ilj(kr)Ym*(kjk)Ym(rjr). (34.3) 
l=0 m=-—; 


The functions j,(kr) (defined by (33.11)) depend only on the product kr, and 
this makes evident the symmetry of the formula with respect to the vectors k 
and r; it does not matter which of the two spherical harmonics is labelled as 
the complex conjugate. 

We normalize the wave function e*z to give a probability current density 
of unity, i.e. so that it corresponds to a flux of particles (parallel to the z-axis) 
with one particle passing through unit area ih unit time. This function is 


p su leetke = y/(m|kħ)etkz, (34.4) 


where v is the velocity of the particles; see (19.7). Multiplying both sides of 
equation (34.1) by /(m/kh) and introducing on the right-hand side the 
normalized functions kimt = Rri* (r) Yim(6, $), we obtain 


2 1 
y= D Vv [+ 1)}— iro ~ hro). 
i=0 ‘ 
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The squared modulus of the coefficient of Y% 17 (or 419+) in this expansion 
determines, according to the usual rules, the probability that a particle in a 
current converging to (or diverging from) the centre has an angular momen- 
tum / (about the origin). Since the wave function v—te** corresponds to a 
current of particles of unit density, this “probability” has the dimensions of 
length squared; it can be conveniently interpreted as the magnitude of the 
“cross-section” (in the xy-plane) on which the particle must fall if its angular 
momentum is /. Denoting this quantity by o, we have 


oy = (21+ 1)/R2. (34.5) 


For large values of 1, the sum of the cross-sections over a range Al of l 
(such that 1 < Al < I) is 


m lh? 
» o1 = —2IAl = 2n——AL 
Al k P 


On substituting the classical expression for the angular momentum, Al = pp 
(where p is what is called the impact parameter), this expression becomes 


2rpåp, 


in agreement with the classical result. This is no accident; we shall see 
below that, for large values of I, the motion is quasi-classical (see §49). 


PROBLEM 


Expand a plane wave in wave functions of states having definite values of the _»y-components m 
of the angular-momentum and p, of the momentum. 

SoLuTion. We take cylindrical polar coordinates y, p, @ with the axis in the y-direction. 
The wave functions of the states in question have the form Q,,(pje’"*e"". If the angle ¢ is 
measured from the z-axis, the expansion can be written as 


a 
eke = eltP cose _ ry Q, (p)e™® 


m= [8] 


(in this case p, = 0), whence 


27 


] 
Qn(P) =z | emrne-me dd = iJ, (kp), 


0 


where J,, (x) is a Bessel function. When kp > 1, we have the asymptotic expression 
` em 2 : l l 
Qai pi #1" [— sin[kp -zn (m —3)]. 
nkp 


§35. Fall of a particle to the centre 


To reveal certain properties of quantum-mechanical motion it is useful to 
examine a case which, it is true, has no direct physical meaning: the motion 
of a particle in a field where the potential energy becomes infinite at some 
point (the origin) according to the law U(r) = —/r®, E > 0; the form of the 


§35 Fall of a particle to the centre 115 


field at large distances from the origin is here immaterial. We have seen in 
§18 that this is a case intermediate between those where there are ordinary 
stationary states and those where a “‘fall” of the particle to the origin takes 
place. 

Near the origin, Schrédinger’s equation in the present case is 


R"+2R' /r+yR/r* = 0, (35.1) _ 


where R(r) is the radial part of the wave function, and we have introduced 
the constant 
y = 2mB/h*?—((14-1) (35.2) 


and have omitted all terms of lower orders in 1/7; the value of the energy E 
is supposed finite, and so the corresponding term in the equation is omitted 
also. 

Let us seek R in the form R~~r*; we then obtain for s the quadratic 
equation 


s(s+1)+y = 0, 


which has the two roots 
& = —3+ VG—y), 5: = —4— v(ł—y). (35.3) 


For further investigations it is convenient to proceed as follows. We draw 
a small region of radius 7, round the origin, and replace the function —y/7? 
in this region by the constant —y/7,?. After determining the wave functions 
in this ‘‘cut off” field, we then examine the result of passing to the limit 
To > 0. 

Let us first suppose that y < }. Then s and s, are real negative quantities, 
and sı > so. For r > 79, the general solution of Schrédinger’s equation 
has the form (always restricting ourselves to small r), 


R = Ar'i+ Br, (35.4) 
A and B being constants. For r < ro, the solution of the equation 
R"+2R'/r+yR/r.? = 
which is finite at the origin has the form 


sin kr 
R=C », k= Vy/r5. (35.5) 
r 





For r = ro, the function R and its derivative R’ must be continuous. It is 
convenient to write one of the conditions as a condition of continuity of the 
logarithmic derivative of rR. This gives the equation 


A(s;+ 1)ro + B(so+ 1)ro* 


Artt Brost? = R cot Ero, 
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or 


A(s,4+ l)ro t B(s24 1)ro" 
— S VY cot Vy. 
Ar + Bra: 


On solving for the ratio B/A, this equation gives an expression of the form 


B/A = constant x r515: (35.6) 


Passing now to the limit rọ > 0, we find that B/A —> 0 (recalling that 
sı > $2). Thus, of the two solutions of Schrédinger’s equation (35.1) which 
diverge at the origin, we must choose that which becomes infinite less rapidly: 


R = Ajr'si (35.7) 
Next, let y > 4. Then s and s, are complex: 
s = —$+iv(y—h), s2 = 5%. 


Repeating the above analysis, we again arrive at equation (35.6), which, on 
substituting the values of s, and s,, gives 


B/A = constant x7iV@7-9. (35.8) 
3 


On passing to the limit r, > 0, this expression does not tend to any definite 
limit, so that a direct passage to the limit is not possible. Using (35.8), the 
general form of the real solution can be written 


R =-constant x771/2 cos( 4/(y—}4) log (7/r0)-+ constant). (35.9) 


This function has a number of zeros which increases without limit as ro 
decreases. Since, on the one hand, the expression (35.9) is valid for the 
wave function (when r is sufficiently small) with any finite value of the energy 
E of the particle, and, on the other hand, the wave function of the normal 
state can have no zeros, we can infer that the “normal state” of a particle in 
the field considered corresponds to the energy E = — œ. In every state of a 
discrete spectrum, however, the particle is mainly in a region of space where 
E > U. Hence, for E -> — œ, the particle is in an infinitely small region 
round the origin, i.e. the particle falls to the centre. 

The ‘“‘critical” field Ug for which the fall of a particle to the centre 
becomes possible corresponds to the value y =}. The smallest value of the 
coefhcient of —1/r? is obtained for / = 0, 1.e. 


Ua = —h2/8mr?. (35.10) 


It is seen from formula (35.3) (for sı) that the permissible solution of Schré- 
dinger’s equation (near the point where U ~ 1 |r?) diverges, as 7 > 0, not 
more rapidly than 1/./7. If the field becomes infinite, as 7 -> 0, more slowly 
than 1/72, we can neglect U(r), in Schrédinger’s equation near the origin, 
in comparison with the other terms, and we obtain the same solutions 
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as for free motion, i.e. Ų ~ r’ (see §33). Finally, if the field becomes infinite 
more rapidly than 1/r? (as —1/r® with s > 2), the wave function near the 
origin is proportional to rt#-! (see §49, Problem). In all these cases the 
product ry tends to zero at r = 0. 

Next, let us investigate the properties of the solutions of Schrodinger’s 
equation in a field which diminishes at large distances according to the law 
U x —B/r?, and has any form at small distances. We first suppose that 
y< }. It is easy to see that in this case only a finite number of negative 
energy levels can exist.t For with energy E = 0 Schrédinger’s equation 
at large distances has the form (35.1), with the general solution (35.4). The 
function (35.4), however, has no zeros (for r # 0); hence all zeros of the 
required radial wave function lie at finite distances from the origin, and their 
number is always finite. In other words, the ordinal number of the level 
E = C which terminates the discrete spectrum is finite. 

If y > 4, on the other hand, the discrete spectrum contains an infinite 
number of negative energy levels. For the wave function of the state with 
E = 0 has, at large distances, the form (35.9), with an infinite number of 
zeros, so that its ordinal number 1s always infinite. 

Finally, let the field be U = —§/r* in all space. Then, for y > }, the 
particle falls, but if y < } there are no negative energy levels. For the 
wave function of the state with E = 0 is of the form (35.7) in all space; it has 
no zeros at finite distances, i.e. it corresponds to the lowest energy level (for 
the given /). 


§36. Motion in a Coulomb field (spherical polar coordinates) 


A very important case of motion in a centrally symmetric field is that of 
motion in a Coulomb field 


U = +a/r 
(where « is a positive constant). We shall first consider a Coulomb attraction, 
and shall therefore write U = —a/r. It is evident from general considera- 


tions that the spectrum of negative eigenvalues of the energy will be discrete 
(with an infinite number of levels), while that of the positive eigenvalues will 
be continuous. - 

Equation (32.8) for the radial functions has the form 


dR 2dR KI+1) 


dr? r dr r? 
If we are concerned with the relative motion of two attracting particles, m 
must be taken as the reduced mass. 
In calculations connected with the Coulomb field it is convenient to use, 
instead of the ordinary units, special units for the measurement of all quanti- 
ties, which we shall call Coulomb units. As the units of measurement of 
mass, length and time, we take respectively 





2m ed 
R+—(E+")R = 0). (36.1) 
h2 r 


my, h2/ma, F8 fra. 


t It is assumed that for small y the field is such that the particle does not fall. 
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All the remaining units are derived from these; thus the unit of energy is 
ma? hi, 
From now on, in this section and the following one, we shall always (unless 
explicitly stated otherwise) use these units.t 
Equation (36.1) in the new units is 


dR 2dR KI) 


dr? + dr r? 





1 
R+2(E+-)R = 0, (36.2) 
Y 


DISCRETE SPECTRUM 


Instead of the parameter E and the variable r, we introduce the new 
quantities 


n=1//({—2E), p = 2rjn. (36.3) 


For negative energies, n 1s a real positive number. The equation (36.2), on 
making the substitutions (36.3), becomes 





2 n I(I[+1) 

R” +-R'+ | -1+-— . i4 = 0 (36.4) 
P P P 

(the primes denote differentiation with respect to p). 

For small p, the solution which satisfies the necessary conditions of finite- 
ness is proportional to p! (see (32.15)). To calculate the asymptotic be- 
haviour of R for large p, we omit from (36.4) the terms in 1/p and 1/p? and 
obtain the equation 

RY = yR, 


whence R = ett, The solution in which we are interested, which vanishes 
at infinity, consequently behaves 71. e—? for large p. 
It is therefore natural to make the substitution 


R = ple-e/2z(p), (36.5) 
when equation (36.4) becomes 


pw" +-(214-2—p)i' +(n—I—1)w = 0. (36.6) 


+ If m = 9-11 x10-?! g :s the mass of the electron, and x = e? (where e is the charge on 
the electron), the Coulomb units are the same as what are called atomic umts. The atomic 
unit of length is 

Alme: = 0-529 x 10-8 cm 
(what is called the Bohr radius). The atomic unit of energy is 
metih? = 4-36 x10-!! erg = 27-21 electron-volts. 


A quantity equal to one half of this umit ts called a rydberg. The atomic unit of charge ts 
e = 4-80 x10-?!? esu. We formally obtain the formulae in atomic units by putting e = m = 
h = 1. Forz = Ze? the Coulomb and atomic units are not the same. 
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The solution of this equation must diverge at infinity not more rapidly than 
every finite power of p, while for p = 0 it must be finite. The solution which 
satisfies the latter condition is the confluent hypergeometric function 


w= FER A p) (36.7) 


(see §d of the Mathematical Appendices)}.t A solution which satisfies the 
condition at infinity is obtained only for negative integral (or zero) values of 
—n+/+1, when the function (36.7) reduces to a polynomial of degree 
n—I—1. Otherwise it diverges at infinity as e? (see (d.14)). 

Thus we reach the conclusion that the number n must be a positive integer, 
and for a given / we must have 


n> 141, (36.8) 
Recalling the definition (36.3) of the parameter n, we find 
Bie S10 on ale (36.9) 


This solves the problem of determining the energy levels of the discrete 
spectrum in a Coulomb field. We see that there are an infinite number of 
levels between the normal level E = —} and zero. The distances between 
successive levels diminish as 7 increases; the levels become more crowded 
as we approach the value E = 0, where the discrete spectrum closes up into 
the continuous spectrum. In ordinary units, formula (36.9) ist 


E = —m2x?/2h?n’. (36.10) 


The integer 7 is called the principal quantum number. The radial quantum 


number defined in §32 is 


nr, = n—I—l. 
For a given value of the principal quantum number, l can take the values 
Rr. va (36.11) 


i.e. n different values in all. Only z appears in the expression (36.9) for the 
energy. Hence all states with different / but the same n have the same energy. 
Thus each eigenvalue is degenerate, not only with respect to the magnetic 
quantum number m (as in any motion in a centrally symmetric field) but 
also with respect to the number /. This latter degeneracy (called accidental 
or Coulomb degeneracy) is a specific property of the Coulomb field. To each 
value of / there correspond 2/+1 different values of m. Hence the degree of 
degeneracy of the mth energy level is 


na—1 
(241) = m. (36.12) 
=0 





t The second solution of equation (36.6) diverges as p~? -3 as p > 0. 


t Formula (36.10) was first derived by N. Bohr in 1913, before the discovery of quantum 
mechanics. In quantum mechanics it was derived by W. Pauli in 1926 using the matrix 
method, and a few months later by E. Schrödinger using the wave equation. 
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The wave functions of the stationary states are determined by formulae 
(36.5) and (36.7). The confluent hypergeometric functions with both 
parameters integral are the same, apart from a factor, as what are called the 
generalized Laguerre polynomials (see §d of the Mathematical Appendices). 
Hence 


Rj = constant x pe~’ arp). 
The radial functions must be normalized by the condition 
i Rair dr = 1 
0 
Their final form ist 
Mie Be. BY Safir 
rin( =) Lati (= 
aE n n 
2 (n-+1)! 


hie ee 


(36.13) 

the normalization integral is calculated by (f.6).] 

Near the origin, R, has the form 
ol+1 I 
Razr aired GOL eae) (36.14) 
n2+l(214-1)! N (n—I—1)! 
At large distances, 
Á m 

Rn x (1 erre, (36.15) 


arth y[(n+i)!(n—i—1)!] 


The wave function R,, of the normal state decreases exponentially at distances 
of the order r ~ 1, i.e. 7 ~ h?/mzx in ordinary units. 


+ We give the first few functions Ry: explicitly: 


Rio = 207, 
Roy = (1 v 2)je™ (1-4), 
Ra = (12 O)e er, 


2303 s{ 1 í : Ji 
= r — +—r- 
Ræ = ( je- ( 3 5 


Ra = (8/27 vojerr(1=z), 
Rap = (4/81 v30)e7 3r. 


t The normalization integral can also be calculated by substituting the expression (d.13) 
fos the Laguerre polynomials and integrating by parts (similarly to the calculation of the 
integral (c.8) for the Legendre polynomials). 
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The mean values of the various powers of r are calculated from the formula 


a 
rk = Í rĚt2R 2 dr. 
0 


The general formula for 7* can be obtained by means of formula (f.7). Here 
we shall give the first few values of yk (for positive and negative k): 


7 = 4[3n®—K(141)], 7? = 4n%{5n?+1—31(1-+1)], 
E iis (36.16) 
ri = ln, r = 1/n(144). 


CONTINUOUS SPECTRUM 


The spectrum of positive eigenvalues of the energy is continuous and 
extends from zero to infinity. Each of these eigenvalues is infinitely degener- 
ate; to each value of E there corresponds an infinite number of states, with 
l taking all integral values from 0 to œ (and with all possible values of m for 
the given /). 

The number n and the variable p, defined by the formulae (36.3), are now 
purely imaginary : 


n = —if ¥(2E) = —ijk, p = 2ikr, (36.17) 


where k = 4/(2E).t The radial eigenfunctions of the continuous spectrum 
are of the form 


Cri 


Sa (2kr)le-**" F(i/k+-14-1, 214-2, 2ikr), (36.18) 





ki 


where the Cy; are normalization factors. They can be represented as a 
complex integral (see §d): 








1 —i /k—i-1 
Re = Cri 2kr lo—~ikr ef{ 1— ¿-2-2 dé, 
) 
2ni 


2ikr 
é (36.19) 


which is taken along the contour{ shown in Fig. 10. The substitution 
E = 2ikr(t+4) converts this integral to the more symmetrical form 





+ It would be possible to define n and p by the complex conjugate expressions n = tjk, 
p = —2ikr; the real functions Rx: do not, of course, depend on which definition is used. 

t Instead of this contour we could use any closed loop passing round the singular points 
€=0 and £& = 2ikr in the positive direction: For integral l, the function V(é) = 
E-"-'(£—2rkr)2—-! (see §d) returns to its initial value on Passing round such a contour. 
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(—2kr) 1 
Rr 2 rar a $ ertkrt(y +4) [K-I-1(¢__})-t /k-—l-1 dt : (36.20) 
T 


the path of integration passes in the positive direction round the points 
t = +}. Itis seen at once from this representation that the functions R,, 
are real. 

The asymptotic expansion (d.14) of the confluent hypergeometric function 
enables us to obtain immediately a similar expansion for the wave functions 
R,,. The two terms in (d.14) give two complex conjugate expressions in the 
function R,,, and as a result we obtain 


e7" [2k etlkr—nQ0+1)/2+(1 /k) log 2kr] | 

PNG co — ae ea) a 

seein Mas Gn a) Sh l sia ) 
(36.21) 


If we normalize the wave functions on the “*k/27scale”’ (i.e. by the condition 
(33.4)), the normalization coefficient is 


Cpr = 2her/2k|T(141—i/k)}. (36.22) 


For the asymptotic expression for R, when r is large (the first term of the 
expansion (36.21)) is then of the form 


Ra & Zin (+ + ; log 2hr — in +81), 
7 


(36.23) 
ô = arg F(1+1— ijk), 


in agreement with the general form (33.20) of the normalized wave functions 
of the continuous spectrum in a centrally symmetric field. The expression 
(36.23) differs from (33.20) by the presence of a logarithmic term in the argu- 
ment of the sine; however, since log r increases only slowly compared with 7 
itself, the presence of this term is immaterial in calculating a normalization 
integral which diverges at infinity. l 

The modulus of the gamma function which appears in the expression 
(36.22) for the normalization factor can be expressed in terms of elementary 
functions. Using the familiar properties of gamma functions: 


(241) = 20 (2), P(e) PU—2) = risinn, 
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we have 

D(l4+-1+472/k) = (I+i/k) ... (1+7/R)(i/k)T(a/k), 

EEEE 0s. USO ae, 


and also 
|[DQ+1—i/k)] = [P01 ik) (I+ 14 4/)p 
-= FETI (rF) aint, 
r Er k? k 
Thus 


Cy = as 5 1 Į] A (#45): (36.24) 


for 1 = 0 the product is replaced by unity. 
The radial function for the special case of zero energy can be obtained by 
taking the limit k > 0, for which 


F(j+041 21 +-2, 2ikr ) ari 2l +-2, 2ikr ) 


sZ pa 
(HE! (AINA 4-3) 2! 


= (21+ 1)! (2r) -1-1/2 Jor (v (8r)), 


where Jər+ı is a Bessel function. The coefficients Cy; (36.24) for k > 0 
become 


Cua = a/ (87r )k-i+1/2, 
Hence 


[Real V k]e-0 = V(47/r) Jer sa(V/(87))- (36.25) 
The asymptotic form of this function for large r ist 
[Rri Vk]k-0 = (8/r3)1/4 sin (/(8r) — ir — $r). (36.26) 


The factor yk disappears if we change to normalization on the energy scale, 


i.e. from the functions Rx; to Re: given by (33.5); the latter remains finite as 
E +0. 


t It may be noted that this function corresponds to the quasi-classical approximation (§49) 
applied to motion in the region (/+ 4)? <r < k-?. 
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In a repulsive Coulomb field (U = afr) there is only a continuous spectrum 
of positive eigenvalues of the energy. Schrédinger’s equation in this field 
can be formally obtained from the equation for an attractive field by changing 
the sign of r. Hence the wave functions of the stationary states are found 
immediately from (36.18) by the same alteration. The normalization co- 
efficient is again determined from the asymptotic expression, and as a result 
we obtain 


Cki 


e T e 
Greeny 


2kr\leik" F(i/k-+-14+-1, 214-2, —2ikr), 
( 


Cra = 2 ke-*/2k|P(L4-1+43/R)| 


8a p 
= ee / (s +7). (36.27) 


=l 
The asymptotic expression for this function for large r is 


2 1 
Ries a e- eee y ai) 
äi > sin(# pe ae (36.28) 


ô = arg [(1+1+i/k). 


THE NATURE OF THE COULOMB DEGENERACY 


In classical motion of a particle in a Coulomb field, there is a conservation 
law peculiar to this type of field; if the field is an attractive one, 


A = r/r—p x1 = constant (36.29) 
(see Mechanics, §15). In quantum mechanics, the corresponding operator is 

A = r/r—4(pxi-1xp), (36.30) 
and is easily seen to commute with the Hamiltonian H = }p?—1/r. 


Direct calculation gives the following commutation rules for the operators 
A; with one another and with the angular momentum operators: 


(i, Ax} = ianÂn {A;, Ax} = —2iHeiuhh. (36.31) 


The non-commutativity of the A; means that the quantities Az, Ay, Az 
cannot simultaneously have definite values in quantum mechanics. Any one 
of the operators, say 42, commutes with the corresponding angular momen- 
tum component /,, but not with the squared angular momentum operator 12. 
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The existence of a further conserved quantity, which cannot be measured 
simultaneously with the others, leads (see §10) to an additional degeneracy 
of the levels, and this is the “‘accidental’’ degeneracy of the discrete energy 
levels, peculiar to the Coulomb field. 

The origin of this degeneracy may also be formulated in terms of the 
increased symmetry of the Coulomb problem in quantum mechanics, in 
comparison with the symmetry relative to spatial rotations (V. A. Fok 1935). 
For this purpose we note that, in discrete-spectrum states with a fixed 
negative energy, we can replace H on the right of the second equation (36.31) 
by E, and use instead of the A; the operators % = A;/V/(—2E). The com- 
mutation rules for these operators are 


(hi, tix} = text, {âi Be} = ieil. (36.32) 


These, together with the rule Li, lx} = oem: are formally identical with 
the commutation rules for the operators of infinitesimal rotations in four- 
dimensional Euclidean space.t This is the symmetry of the Coulomb 
problem in quantum mechanics. 

From the commutation rules (36.32) we can again derive an expression for 
the energy levels in a Coulomb field.|| They can be rewritten by using 
instead of Î and ù the operators 


hh = 4(0+u0), fe = 3-4). (36.33) 
For these, 
Sire fix} = tere, {fois fex} = tecxtjan {fii fox} = 0. (36.34) 


These are formally identical with the commutation rules for two independent 
three-dimensional angular momentum vectors. ‘The eigenvalues of j1? and 
j2? are therefore j1(j1+ 1) and jo(je+ 1), where ji, j2 = 0, $, 1, 3, .... tt On the 
other hand, the definition of the operators û and I = r xp shows by a simple 
calculation that 





t Here lz, ly, lz represent the operators of infinitesimal rotations in the yz, zx and xy planes 
in four-dimensional Cartesian coordinates x, y, z, u; tz, ty, Uz are the operators of infinitesimal 
rotations in the xu, yu and zu planes. 

t The symmetry appears explicitly in the wave functions in the momentum representation: 
see V. A. Fok, Zeitschrift für Physik 98, 145, 1935. 

|| ‘This derivation is essentially as given by W. Pauli (1926). 

tt Here we anticipate the properties of the angular momentum that are to be described in 
§54 (the possibility of integral and half-integral 7). 
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1 


Pat = =] a 
2E 


with H again being replaced by E in calculating 12442. Hence 


(where j = ji = j2), and then E = —1/2(27+1)*. With the notation 


DE we aL (36.35) 


we get the required result E = 1/2n?. The degree of degeneracy of the levels 
is (2j1 + 1)(2j2+ 1) = (2J +1)? = n?, as it should be. Lastly, since Î = Jı +Je, 
for a given value of jı = jg = $(m—1) the orbital angular momentum / takes 
values from 0 to 2j = n—1.T 


PROBLEMS 


PROBLEM 1. Determine the probability distribution of various values of the momentum 
in the ground state of the hydrogen atom. 


SoLuTION.{ The wave function of the ground state is y = RioYoo = (1/\/7)e*. The 
wave function of this state in the p representation is then given by the integral 


a(p) = | o(rje mt dV 


(see (15.10)). The integral is calculated by changing to spherical polar coordinates with the 
polar axis along p; the result is 


+ The “accidental” degeneracy of levels with different values of / occurs also for motion 
in a centrally symmetric field U = }mw?*r? (a three-dimensional oscillator; see §33, Problem 
4). This degeneracy is likewise due to the extra symmetry of the Hamiltonian. In this case, 
the symmetry arises because in A = §2/2m+4mw*r? both the operators f: and the coordi- 
nates xı occur as sums of squares. If they are replaced by the operators 


> _ Mux; + 1p 

' A(2mhw) 

a Mite — 1p; 
üi = e, 
a/(2mħw) 


we obtain 
H = hof[â +. â +3). 


This is invariant under any unitary transformations of the operators 4; * and 4 forming a 
group that is wider than that of the three-dimensional rotations (under which the particle 
Hamiltonian is invariant in any centrally symmetric field). — A 

The specific property of the Coulomb and oscillator fields in quantum mechanics (presence 
of accidental degeneracy) is in correspondence with the fact that in classical mechanics closed 
particle trajectories exist in these (and only these) fields. 

t In Problems 1 and 2, atomic units are used. 
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= 84/7 
(1p 


` 


a(p) 


and the probability density in p-space is |a(p)|?#/(27)*. 
PROBLEM 2. Determine the mean potential of the field created by the nucleus and the 
electron in the ground state of the hydrogen atom. 


SoLuTION. The mean potential $e created by an “electron cloud” at an arbitrary point r 
is most simply found as the spherically symmetric solution of Poisson’s equation with charge 
density p = —|4}?: 


nba = 4e-2r 
Y ane) i 


Integrating this equation, and choosing the constants so that ¢-(0) is finite and ¢.(00) = 0, 
and adding the potential of the field of the nucleus, we obtain 


1 1 
=-+¢7) = (- +ijew, 
| r 


For r & 1 we have ¢ = 1/r (the field of the nucleus), and for r > 1 the potential ¢ = e`% 
(the nucleus is screened by the electron). 


U(r) 


am | 


Fic. 11 


ProsLem 3. Determine the energy levels of a particle moving in a centrally symmetric 
field with potential energy U = A/r?—B/r (Fig. 11). 

So.uTION. The spectrum of positive energy levels is continuous, while that of negative 
levels is discrete; we shall consider the latter. Schrédinger’s equation for the radial function is 


dR 2dR 2m 7 H p A B R=0 a) 
Rr a Te = = 
We introduce the new variable 
p =2V(—2mE)rih, 
and the notation 
2mA/h?+i(l+1) = s(s+1), (2) 
By (m!/—2E):h = n. (3) 


Then equation (1) takes the form 


2 1 1 
PETETA CENON 
P PP ka 
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which is formally identical with (36.4). Hence we can at once conclude that the solution 
satisfying the necessary conditions is 


R= pie? F(—n-bs+ I; 2s+2, P) 


where n—s—1 = p must be a positive integer (or zero), and s must be taken as the positive 
root of equation (2). From the definition (3) we consequently obtain the energy levels 


9 


2B*m 


ee 
P h2 





[20+ 1+ v{( 21+ 1)°-+8mA/h?}\-* 


PROBLEM 4. The same as Problem 3, but with U = Ajr? + Br’ (Fig. 12). 
SoLuTION. There is only a discrete spectrum. Schrédinger’s equation is 


dR 2dR amr (l+) A` 
= =| E-— |r = 0. 
d? xr dr FF 2mr2 r 
Introducing the variable 
E = V/(2mB)rjh 
u(r) 
r 
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and the notation 
K(l-++-1)-+2mA jf? = 2s(2s+ 1), 


V(2m/B)E/h = 4(n-+s)-+3, 


we obtain the equation 
3 
ER’ + OR a a Dek = 0. 


The solution required behaves asymptotically as ef when ¢—> œ, while for small ¢ it is 
proportiona) to £’, where s must be taken as the positive quantity 


s = P—14 V{(2l+1)?-++8mA/h4}]. 
Rence we seek a solution in the form 
R = epu, 
obtaining for w the equation 


3 
Ew + (245—¢ )u'-+nw = 0, 
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whence 
3 
w= F(—n, aS. e), 


where n must be a non-negative integer. We consequently find as the energy levels the infinite 
aet of equidistant values 


= å v(B/2m[4n+2+ v{(21+ 1)?-+8mA /h?}], n = 0,1,2, ... 


£37. Motion in a Coulomb field (parabolic coordinates) 


- The separation of the variables in Schrédinger’s equation written in 
spherical polar coordinates is always possible for motion in any centrally 
symmetric field. In the case of a Coulomb field, the separation of the variables 
is also possible in what are called parabolic coordinates. ‘The solution of the 
problem of motion in a Coulomb field in terms of parabolic coordinates is 
useful in investigating a number of problems where a certain direction in 
space is distinctive; for example, for an atom in an external electric field 
(see §77). 
The parabolic coordinates €, n, ¢ are defined by the formulae 


wim vepri yp = Vean, = = Am, : (37.1) 
r = V(x*+y?+22) = (E+), 
or conversely 
E=r+z, 7 =r—z, ¢ = tan (y/x); (37.2) 


€ and 7 take values from 0 to œ, and ¢ from 0 to 27. The surfaces = 
constant and 7 = constant are paraboloids of revolution about the z-axis, 
with focus at the origin. This system of coordinates is orthogonal. The 
element of length is given by the 


(di)? = Elan T L (37.3) 
and the element of volume is 
dV = {(ġ+n)dédndġ. (37.4) 
From (37.3) we have the ow operator 
bec 
2 aÉ a aA he age ji 


Schrödinger’s equation for a particle in an attractive Coulomb field with 
U = —1jr = —2/(€+7) is 
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E) a =) |+ te shte(Ete—)e=0. 6876 
ree EE mage N TEF 


Let us seek the eigenfunctions W in the form 
b = fie) flne, (37.7) 


where m is the magnetic quantum number. Substituting this expression in 
equation (37.6) multiplied by 1(€+7), and separating the variables & and 7, 
we obtain for fı and fz the equations 


- (E ERE en = 0, 
(37.8) 


(r Buhe ton aS, 


where the separation parameters B;, Be are related by 


B,\ +B. = 1. (37.9) 


Let us consider the discrete energy spectrum (E < 0). We introduce in 
place of E, é, 7 the quantities 


nS EBB = EEE) =e, Salm, WA 
whereupon we obtain the equation for fi 


d? i d 1 /|m\+1 m? 
LA CoN +[-i4 1+—(' | +m- |A =i. (37.11) 
Pi 2 4p,° 


dp,? a dp, 








and a similar equation for fz, with the notation 
m = —4H(\m|+1)+n8,, n = —4(|m|+1)+n8,. (37.12) 


Similarly to the calculation for equation (36.4), we find that f, behaves as 
e—P: for large p, and as p,*'”! for small p,. Accordingly, we seek a solution 
of equation (37.11) in the form 


Siler) = eap wp), 
and similarly for fe, obtaining for w the equation 
pw Him +p tn = 0. 


This is again the equation for a confluent hypergeometric function. The 
solution satisfying the conditions of finiteness 1s 


w, = F(—n, |m|+1, pi), 


where nı must be a non-negative integer. 
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Thus each stationary state of the discrete spectrum is determined in para- 
bolic coordinates by three integers: the parabolic quantum numbers nı and 
no, and the magnetic quantum number m. For n, the principal quantum 
number, we have from (37.9) and (37.12) 


n = m+n+|m +1. (37.13) 


For the energy levels, of course, we obtain our previous result (36.9). 

For given n, the number |m| can take n different values from 0 to n—1. 
For fixed n and |m| the number n takes n— |m] values, from 0 to n—|m|—1. 
Taking into account also that for given |m| we can choose the functions with 
m = + |m|, we find that for a given n there are altogether 


2" (n—m)-+(n—0) =n 
mæl 
different states, in agreement with the result obtained in §36. 


The wave functions Yn nim Of the discrete spectrum must be normalized 
by the condition 


[iena dV =} TE Wye, nle d¢dédy = 1. (37.14) 


000 
The normalized functions are 
y2 g n\ em? 
nnm `~“; Jn miT nimt Io 37.15 
ane n? fr, G) G) v (27) ar) 
where 
1 /(p+|ml)! 
Jomp) = aT ee |m| +1, pje 2p, (37.16) 


The wave functions in parabolic coordinates, unlike those in spherical 
polar coordinates, are not symmetrical about the plane z = 0. For nı > n 
the probability of finding the particle in the direction z > 0 is greater than 
that for z < 0, and vice versa for nı < no. 

To the continuous spectrum (E > 0) there corresponds a continuous spec- 
trum of real values of the parameters fj, £a in equations (37.8) (connected as 
before, of course, by the relation (37.9)). We shall not pause to write out here 
the corresponding wave functions, since it is not usually necessary to employ 
them. Equations (37.8), regarded as equations for the “eigenvalues” of the 
quantities £1, 82, have also (for E > 0) a spectrum of complex values. The 
corresponding wave functions are written out in §135, where we shall use 
them to solve a problem of scattering in a Coulomb field. 

The existence of stationary states |7122m)> leads to an additional conser- 
vation law (36.29). In these states, the quantities 7, = m and 4+», as well as 
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the energy, have definite values. Calculating the diagonal matrix elements of 
the operator Az, we find that 


Az = (my —ng)jn. (37.17) 


Here uz = nı — nə, and the components of the “angular momenta” jı and je 
are 


jiz = 3(m+n,—ne) = p, 


(37.18) 


joz = (m-n +n) = pe. 


These properties of the states |z,n2m) (or, equivalently, |nuju2>) make it 
easy to establish the relation between their wave functions and those of the 
states |n/m>. Since 1 = jı + Je, the change from one of these descriptions to 
the other is essentially the construction of wave functions with addition of 
two angular momenta, discussed in §106. In terms of the ‘“‘angular momenta” 
ji and jo, the states |zim> and |nyn2m) are described as | Jyjalm> and | frjopipe, 
where, from (36.35) and (37.13), 


ji = je = (2-1) S41 Fhe + |). (37.19) 
According to the general formulae (106.9)-(106.11), 


dnim = > Cim| uiue) pnuu, 


n, tum 
(37.20) 


n-1 


Pnu n, om > Cl, py + poluipo>vnim 
1=0 


(D. Park 1960). 


CHAPTER VI 


PERTURBATION THEORY 


§38. Perturbations independent of time 


THE exact solution of Schrédinger’s equation can be found only in a com- 
paratively small number of the simplest cases. ‘The majority of problems 
in quantum mechanics lead to equations which are too complex to be solved 
exactly. Often, however, quantities of different orders of magnitude appear 
in the conditions of the problem; among them there may be small quantities 
such that, when they are neglected, the problem is so much simplified that its 
exact solution becomes possible. In such cases, the first step in solving the 
physical problem concerned is to solve exactly the simplified problem, and 
the second step is to calculate approximately the errors due to the small terms 
that have been neglected in the simplified problem. There is a general 
method of calculating these errors; it is called perturbation theory. 

Let us suppose that the Hamiltonian of a given physical system is of the 
form 


H =H +P, 


where P is a small correction (or perturbation) to the unperturbed operator Hy. 
In §§38, 39 we shall consider perturbations V which do not depend explicitly 
on time (the same is assumed regarding Ĥ, also). The conditions which are 
necessary for it to be permissible to regard the operator V as “small” com- 
pared with the operator Ĥ will be derived below. 

The problem of perturbation theory for a discrete spectrum can be formu- 
lated as follows. It is assumed that the eigenfunctions ,„‘® and eigenvalues 
E,,© of the discrete spectrum of the unperturbed operator A, are known, i.e. 
the exact solutions of the equation 


Ayo = Eso) (38.1) 
are known. It is desired to find approximate solutions of the equation 
Ay = (A+ Dy = By, (38.2) 


i.e. approximate expressions for the eigenfunctions y,, and eigenvalues E, of 
the perturbed operator H. 
In this section we shall assume that no eigenvalue of the operator A, is 


degenerate. Moreover, to simplify our results, we shall at first suppose that 
there is only a discrete spectrum of energy levels. 


The calculations are conveniently performed in matrix form throughout. 
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To do this, we expand the required tunction ¢% in terms of the functions 


pn: 
P= Lenn. (38.3) 


Substituting this expansion in (38.2) we obtain 


Dom m+ VY) = em E fen; 


multiplying both sides of this equation by 4,™ and integrating, we find 
(E—E, Je, = 2 Vinta (38.4) 


Here we have introduced the matrix Vym of the perturbation operator P, 
defined with respect to the unperturbed functions Yp: 


Vix, = f prO Pb 0 dg. (38.5) 


We shall seek the values of the coefficients c,, and the energy E in the form 
of series 


E = EOLEOL EOL p Cm = CeO ben +e + ..., 


where the quantities E@) and c,,@) are of the same order of smallness as the 
perturbation P, the quantities E®) and c„® are of the second order of small- 
ness, and so on. 

Let us determine the corrections to the nth eigenvalue and eigenfunction, 
putting accordingly c, = 1, ¢,, = O form # n. To find the first approxi- 
mation, we substitute in equation (38.4) E = E+E, cp =¢,+4e,), 
and retain only terms of the first order. The equation with k = n gives 


E,” = Vin = EA Py dg. (38.6) 


Thus the first-order correction to the eigenvalue E,, is equal to the mean 
value of the perturbation in the state y,, ©). 
The equation (38.4) with k # n gives 


6) = Vien|(E,O—-E,) for k Æ n, (38.7) 


while ¢,,@) remains arbitrary; it must be chosen so that the function ¥, = 
Pn + Yn) is normalized up to and including terms of the first order. For 
this we must put c,{) = 0. For the functions 


E,O—E_© 


YC = 2 Em © (38.8) 


(the prime means that the term with m = n is omitted from the sum) are 
orthogonal to y,,, and hence the integral of |x, +2,,% |? differs from unity 
only by a quantity of the second order of smallness. 
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Formula (38.8) determines the correction to the wave functions in the first 
approximation. Incidentally, we see from this formula the condition for the 
applicability of the above method. This condition is that the inequality 


U sent < | EO — E m| (38.9) 


must hold, i.e. the matrix elements of the perturbation must be small com- 
pared with the corresponding differences between the unperturbed energy 
levels. 

Next, let us determine the correction to the eigenvalue E,,‘® in the second 
approximation. To do this, we substitute in (38.4) E = E+E, +E, ®, 
Cy = Ck HeD +e”, and examine the terms of the second order of small- 
ness. The equation with k = n gives 


EP = E Vane 
whence 


Wum 
(2) — naa TE 
E,@ = > <x (38.10) 


(we have substituted cm{® from (38.7), and used the fact that, since the 
operator P is Hermitian, Vinn = Vnm”*). 

We notice that the correction in the second approximation to the energy 
' of the normal state is always negative; for, since E,,) then corresponds to 
the lowest vaiue of the energy, all the terms in the sum (38.10) are negative. 

The further approximations can be calculated in a similar manner. 

The results obtained can be generalized at once to the case where the 
operator H, has also a continuous spectrum (but the perturbation is applied, 
as before, to a state of the discrete spectrum). To do so, we need only add to 
the sums over the discrete spectrum the corresponding integrals over the 
continuous spectrum. We shall distinguish the various states of the continu- 
ous spectrum by the suffix v, which takes a continuous range of values; by v 
we conventionally understand an assembly of values of quantities sufficient 
for a complete description of the state (if the states of the continuous spec- 
trum are degenerate, which is almost always the case, the value of the energy 


alone does not suffice to determine the state).+ Then, for instance, we must 
write instead of (38.8) 


d 


p, = > Laon Oy. a ag W dy (38.11) 
b m E,O—E,,© $ E,O—E, : i 


and similarly for the other formulae. 

It is useful to note also the formula for the perturbed value of the matrix 
element of a physical quantity f, calculated as far as terms of the first order 
by using the functions yn = yn + yD, with ¥n given by (38.8). The 


t Here the wave functions ¥,( must be normalized by delta functions of the quantities v. 
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following expression 1s easily obtained: 


% ae i V MIn (0) 
fam = fam + >: oe > a (38.12) 
R 


0 
„O— EF, OE, 


In the first sum k # n, while in the second k + m. 


PROBLEMS 


PROBLEM 1. Determine the correction Yẹ‘? in the second approximation to the eigen- 
functions. 


SOLUTION. The coefficients cx‘?! (k 3 n) are calculated from equations (38.4) with k + n, 
written out up to terms of the second order, and the coefficient c,,?) is chosen so that the 


function Yn = gn) + pn + dn is normalized up to terms of the second order. As a result 
we find 


Bwn 


tn s oe on 5 a a, Ohy D Ta 
m R m = 


where we have introduced the frequencies 
= (E,O— Eh 
PROBLEM 2. Determine the correction in the third approximation to the eigenvalues of the 


energy 


SOLUTION. Writing out the terms of the third order of smallness in equation (38.4) with 
k = n, we obtain 


r 





ea y Malate y, y Hat 
m mn 


T 
PROBLEM 3. Determine the energy levels of an anharmonic linear oscillator whose Hamil- 
tonian is 
H = 3 p2/m+ mew? + ax? + Bx, 


SoLuTION. The matrix elements of x? and x‘ can be obtained directly according to the 
rule of matrix multiplication, using the expression (23.4) for the matrix elements of x. We 
find for the matrix elements of x? that are not zero 


(Saan = (Jang = (A/mw)s*4/[4n(n—1)(n—2)], 

in = Oa = (A/ma))§ !?4/(9n*/8). 
The diagonal elements in this matrix vanish, so that the correction in the first approximation 
due to the term ax? in the Hamiltonian (regarded as a perturbation of the harmonic oscillator) 
is zero. The correction in the second approximation due to this term is of the same order as 
that in the first approximation due to the term fx‘. The diagonal matrix elements of x* are 


(A), nw = (Ai mw)* . 3(2n?+2n+1). 


Using the general formulae (38.6) and (38.10), we find the following approximate expression 
for the energy levels of the anharmonic oscillator: 


E, = hapt —) (etat) +5 a (n?-+-n4-2). 
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PROBLEM 4. A spherical potential well with infinitely high walls is subjected to a small 
deformation (without change of volume) which gives it the form of a slightly prolate or 
oblate spheroid with semi-axes a = bandc. Find the splitting of the energy levels of a particle 
in the deformed well (A. B. Migdal 1959). 


SOLUTION. The equation of the well boundary is 


2 2 a2 

x+ “a 

ee e, 
a? c 


and by the change of variables x > ax/R, y >ay|R, z >cz|R it is converted into 
x+y? +2z? = R?, the equation of a sphere with radius R. The same change of variables 
converts the Hamiltonian of the particle, A = p3/2M = —h*A/2M (where M is the mass of 
the particle and the énergy is measured from the bottom of the well) into AH = A,+P, where 


Ao = —h2A/2M, 


he [ {Re 0? g2 R2 g? 
ps: eee te Co pe 1 et. 
2M L\ a? Ox? y? ps Oz? 
Thus the problem of motion in an ellipsoidal well reduces to that of motion in a spherical 


well. If the ellipsoid is almost a sphere of radius R = (a?c)'/*, V may be regarded as a small 
perturbation. If the ellipsoidality £ (S| < 1) is defined by 


az R(1—4£), c = R(1+32), 
the perturbation operator may be written 


P = (p/3M\p? ~ 3,2). 


In the first order of perturbation theory, the change in the energy levels of the particle from 
their values in the spherical well is 


AEnim = Enim — En ® 
= (nlm|V|nim}, 


where / and m are the angular momentum of the particle and its component along the axis of 
the spheroid; n numbers the levels in the spherical well for a given /, which are independent 
of m. Since p*—3p;,? is the zz-component of an irreducible tensor, ĉip? —3pipk, with zero 
trace, we find from (107.2) and (107.6) that the matrix element <nlm|V|nlm> is proportional to 


a= : o 


and therefore 


3m2 
KI+1) 





<nlm|V|nlm> = (1 — ) KOLV lO. 


A table of 3j-symbols is given in §106. 
Next, 


h2 a2 
eee (0 2 T 
(nl0|V|nl0) = 3BEm + B+ <nl0 <'nl0} 


afi !2 
= 2BF,)(0) — p” Hm r2 dr do, 
z 


in the first term we have used Schrédinger’s equation É, Ynim = Ep ® Ynim for a spherical well, 
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and in the second term integrsted by parts. With Yz in the form (28.11), we find the deriva- 
tive of nio = Rai(r) Yi0( 6,4) to be 





ĉ a a 
z fno = (cos 6— = Sdn 


i(l-+1) E E 
= Be U E R i 
a ai i) =a 


al 


. d+) ; 
+ [4121p (Ra a ZiR)? l-1, © 


The radial integrals are calculated by means of the formulae 


Í RniRni r dr = — } f Ry” dr, 
0 


o 


[ Rare ar = Me, yo N+ 1) | Raai 
Oo 


0 
which are derived by integrating by parts and using the radial Schrédinger’s equation (33.3) 


2M 


2 Kit NR. = -Z Em, 
i 


Rn” + —Rar - 
r ve 





The terms containing integrals of Rn? cancel, and the final result is 


AEnin = yy yee che us ~ 3 en 
(27—1)(2l+3) Lae) 3 





Note that 


1 l 
91a l > Enim = En, 


ma-i 


i.e, the ‘centre of gravity” of the multiplet is not shifted. 


§39. The secular equation 


Let us now turn to the case where the unperturbed operator H, has de- 
generate eigenvalues. We denote by ¢,,, ¥,,(, ... the eigenfunctions be- 
longing to the same eigenvalue £,” of the energy. The choice of these func- 
tions is, as we know, not unique; instead of them we can choose any s (where 
s is the degree of degeneracy of the level E,,(°) independent linear combina- 
tions of these functions. The choice ceases to be arbitrary, however, if we 
subject the wave functions to the requirement that the change in them under 
the action of the small applied perturbation should be small. 

At present we shall understand by ¢,,, ¢,,., .-. some arbitrarily selected 
unperturbed eigenfunctions. The correct functions in the zeroth approxima- 
tion are linear combinations of the form 


Cn en + cn Opn... 
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The coefficients in these combinations are determined, together with the 
corrections in the first approximation to the eigenvalues, as follows. 

We write out equations (38.4) with k = n, n’, ..., and substitute in them, 
in the first approximation, E = E,°4 EF); for the quantities c, it suffices 
to take the zero-order values c, = Cp, Cp = Cn, ... 5 Ca = 0 for m #7, 


Z 


n’,.... We then obtain 
E®c, O — z Van’ Cp 


or 
E (Van! —E%8 nn’ Jon = 0, (39.1) 


where n, n’ take all values denumerating states belonging to the given un- 
perturbed eigenvalue F,,°. This system of homogeneous linear equations 
for the quantities c,‘ has solutions which are not all zero if the determinant 
of the coefficients of the unknowns vanishes. Thus we obtain the equation 


[Ve ee | (39.2) 


This equation ts of the sth degree in E® and has, in general. s different real 
roots. These roots are the required corrections to the eigenvalues in the first 
approximation. Equation (39.2) is called the secular equation.t We notice 
that the sum of its roots is equal to the sum of the diagonal matrix elements 
Vans Varn» (this being the coefficient of [E@]5-! in the equation). 

Substituting in turn the roots of equation (39.2) in the system (39.1) and 
solving, we find the coefficients c,‘ and so determine the eigenfunctions 
in the zeroth approximation. 

As a result of the perturbation, an originally degenerate energy level 
ceases in general to be degenerate (the roots of equation (39.2) are in general 
distinct); the perturbation removes the degeneracy, as we say. The removal 
of the degeneracy may be either total or partial (in the latter case, after the 
perturbation has been applied, there remains a degeneracy of degree less than 
the original one). 

It may happen that for some reason all the matrix elements are particularly 
smali (or even zero) for transitions within a group of mutually degenerate 
states z, n', .... It may then be useful to take into account not only in the first 
order the matrix elements Vpn. but also in the higher orders the matrix 
elements Vym (m 4 n,n’, ...) for transitions to states with a different energy. 
Let us do this for the matrix elements V,,,, in the second order. 

In equation (38.4) with k = n we put onthe left E = Ep% + E, (retaining 
the notation E® for the correction to the energy in the approximation 


considered), and replace cn by cy. Since cp = Oforallm # n,n’, ..., we 
have 


EM, = X Vamem® + ` Vome t. (39.3) 
m n' 


mmm 


t The name is taken from celestial mechanics. 
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The equations (38.4) with k = m # n, n', ... give as far as the first-order terms 


(En — Em Jem! = py ree U 


n 


whence 


Finaw 
ety = a HO, 
~ Ep © — Ep (0 
n 


Substitution in (39.3) gives 


| CERTEN AIREA 
EMA = > en yy ae 
” a (Van - EO ra) 
m 


n’ n 
These equations replace (39.1); the condition for them to be compatible 
again leads to the secular equation, which differs from (39.2) by the change 


La 
pa nam mn 
Vonn: > Van + > EOE O (39.4) 


PROBLEMS 


PROBLEM 1. Determine the corrections to the eigenvalue in the first approximation and 
the correct functions in the zeroth approximation, for a doubly degenerate level. 


SoLuTION. Equation (39.2) here has the form 
| Ue ees Vie 
| Va Va- E i 


‘the suffixes 1 and 2 correspond to two arbitrarily chosen unperturbed eigenfunctions y,® 
and wo!) of the degenerate level in question). Solving, we find 


EQ) = 43{(Vir+ F22) + ho], (1) 


Il 


with the notation 


fiw) = 4/{(Vis — Vor)? +4| V121?) 


tt 


for the difference between the two values of the correction E. Solving also equations (39.1) 
with these values of E")), we obtain for the coefficients in the correct normalized function in 
the zeroth approximation, ¥! = e ® p +e, Y, the values 


ca = Vas fi 4 | i 
2| Fizl ħut) 


+ Va fiz E u Fep 
2i Viel iw) 


PROBLEM 2. Derive the formulae for the correction to the eigenfunctions in the first 
approximation and to the eigenvalues in the second approximation. 


SOLUTION. We shall suppose that the correct functions in the zeroth approximation are 
chosen as the functions ¥,{°). The matrix Vnan. defined with respect to these is clearly diagonal 





(2) 
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with respect to the suffixes n, n’ (belonging to the same group of functions of a degenerate 
level), and the diagonal elements Voy, Van, are equal to the corresponding corrections 
E, E,., ... in the first approximation. 

Let us consider a perturbation of the eigenfunction ¢n'), so that in the zeroth approxima- 
tion E = En, c&n = 1, cm = O form # n. Inthe first approximation E = E, + Van, 
Cn = leg, Cm = Cm). We write out from the system (38.4) the equation with k Æ n,n’,.. 
retaining in it terms of the first order: 


at 


( E,9—E,)c,% = Vine! = Vin 
whence 
cE, = Vyn (EnO — EM) for k # 1, 1’, .., (1) 


Next we write out the equation with k = n’, retaining in it terms of the second order: 
E Yep = Fan’ Ca P = Vat mm 


(the terms with m = n, n’, ... dre omitted in the sum over m). Substituting E, = V,a and 
the expression (1) for cm), we obtain for n # n 


l 3 Vaal n 
60 ee — > ee (2) 


Via Van a E,—E,,(0 
(In this approximation the coefficient cnf?) is zero.) Formulae (1) and (2) determine the 
correction Yn = Ecm‘ dn to the eigenfunctions in the first approximation.t 

Finally, writing out the second-order terms in equation (38.4) with k = n, we obtain for 
the second-order corrections to the energy the formula 





j VT nn 
23 a 
E,” = 2 E00 E,(’ (3) 


1n 


which is formally identical with (38.10). 


ProBLEM 3. At the initial instant t = 0, a system is in a state ¥,!°) which belongs to a 
doubly degenerate level. Determine the probability that, at a subsequent instant £, the 
system will be in the state g,'°) with the same energy; the transition occurs under the action 
of a constant perturbation. 


SOLUTION. We form the correct functions in the zeroth approximation, 


Y = Ch, + Cope, w= ehitis Ye, 


where c, C3; Cy’, Cy) are two pairs of coefficients determined by formulae (2) of Problem 1 
(for brevity, we omit the index (® on all quantities). 
Conversely, 
Co p— cay’ 

‘i aa 
Cila — Ca Cy 
The functions ù and y¢’ belong to states with perturbed energies E+E) and E+E”, where 
E™ and ECY are the two values of the correction (1) in Problem 1. On introducing the time 
factors we pass to the time-dependent wave functions: 


enti /MEt = 
Y, = — je heme cyf et IME” 4 
Clg — Cr Cg 





_t Note that the condition for the quantities (1) and (2) to be small (and therefore the condi- 
tion for this method of perturbation theory to be applicable) again requires the conditions 
(38.9) to be satisfied only for transitions between states belonging to different energy levels. 
Transitions between states belonging to the same degenerate level are taken into account 
exactly (in a certain sense) by the secular equation. 
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(at time t = 0, Y, = y). Finally, again expressing y, ¢’ in terms of y}, Ya, we obtain Y, 
as a linear combination of y, and We, with coefficients depending on time. The squared modu- 
lus of the coefficient of Yẹ, determines the required transition probability w. Calculation 
with (1) and (2) from Problem 1 gives 


5}2 
wg = ———— {l -cos wtr} 


We see that the probability varies periodically with time, with frequency w™®. 

For times £ which are small compared with the period in question, the expression in the 
braces, and therefore wz, is proportional to t?: w, = |V\,.|22?/A?. This formula can be very 
simply obtained by the method given in the next section (using equation (40.4)). 


§40. Perturbations depending on time 


Let us now go on to study perturbations depending explicitly on time. We 
cannot speak in this case of corrections to the eigenvalues, since, when the 
Hamiltonian is time-dependent (as will be the perturbed operator H = H,+ 
+V(t)), the energy is not conserved, so that there are no stationary states. 
The problem here consists in approximately calculating the wave functions 
from those of the stationary states of the unperturbed system. 

To do this, we shall apply a method analogous to the well-known method 
of varying the constants to solve linear differential equations (P. A. M. Dirac 
1926). Let ‘VY, be the wave functions (including the time factor) of the 
stationary states of the unperturbed system. Then an arbitrary solution of the 
unperturbed wave equation can be written inthe form of asum Y = Za;'¥,. 
We shall now seek the solution of the perturbed equation 


ih OV et = (Ay + VYF (40.1) 
in the form of a sum 


E= puo, (40.2) 


where the expansion coefficients are functions of time. Substituting (40.2) 
in (40.1), and recalling that the functions ¥,©® satisfy the equation 


ih oF, Ojat = ALY, 


we obtain 
da, 
ih > Yo = > a, VY. 
Z k di - k k 


Multiplying both sides of this equation on the left by V,,‘©™ and integrating, 
we have 


hi 





oS eae (40.3) 
dt <4 
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where 
Von: = | EnO PH dg 


_ Em) -E0 
h ? 


= Vme emrt,  wmk 


are the matrix elements of the perturbation, including the time factor (and 
it must be borne in mind that, when V depends explicitly on time, the quanti- 
ties Vm also are functions of time). 

As the unperturbed wave function we take the wave function of the nth 
stationary state, for which the corresponding values of the coefficients in 
(40.2) are a, = 1, a, = 0 for k # n. To find the first approximation, 
we seek a, in the form a, =a, +a, P, substituting a, =a, on the 
right-hand side of equation (40.3), which already contains the small quantities 
Vink» This gives 


ih da,™/dt = Vpt). (40.4) 


In order to show the unperturbed function to which the correction is being 
calculated, we introduce a second suffix in the coefficients ap, writing 


Yn = 2 apn(t) F. 


Accordingly, we write the result of integrating equation (40.4) in the form 
Aunt? = —(ifh) f Vaglt) dt = (ilh) | Vineet de. (40.5) 


This determines the wave functions in the first approximation. 
Let us now consider in more detail the important case of a perturbation 
which is periodic with respect to time, of the form 


DP = feit 4 Gaot, (40.6) 


where F and G are operators independent of time. Since Ŷ is Hermitian, 
we must have 


Fe-tot + Getut = Fretot4 G te-tot, 
whence G = F+, i.e. 
Gam = Finn*. (40.7) 


This relation shows that 


Vien(t) = V pretexnt = Fynetlorn—o)t 4. Fy Melon tat, (40.8) 
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Substituting in (40.5) and integrating, we obtain the following expression for 
the expansion coefficients of the wave functions: 


Fyne rnt F pt Okto 
akd = ——_______ — —_____—__. (40.9) 
hi(Wyn—) (wnt w) 


These expressions are applicable if none of the denominators vanishes,t i.e. 
if for all k (and the given n) 


E,O—E,O Z the. (40.10) 


In a number of applications it is useful to have expressions for the matrix 
elements of an arbitrary quantity f, defined with respect to the perturbed 
wave functions. In the first approximation we have 


Fal?) 1 Jam) + fa aU; 


where 
fan) = | EOE dg = faman, 
PaO = f Ero, 


Substituting here Y, = E a,,V,, with apn? determined by formula 
(40.9), it is easy to obtain the required expression 


(0) (0) 
fant) = -emnt D Sor Erm Sam Ene ett 
k A(wrm—w) hi(Wyn+w) 


n aan » mO Fn y ; 
Sait Fenn Sim F nk a F (40.11) 


A(Wpm+tw)  A(Wyn—w) 


This formula is applicable if none of its terms becomes large, i.e. if none of 
the frequencies wpn, wem 1s too close to w. For w = 0 we return to formula 
(38.12). 

In all the formulae given here, it is understood that there is only a discrete 
spectrum of unperturbed energy levels. However, these formulae can be 
immediately generalized to the case where there is also a continuous spectrum 
(as before, we are concerned with the perturbation of states of the discrete 
spectrum); this is done by simply adding to the sums over the levels of the 
discrete spectrum the corresponding integrals over the continuous spectrum. 
Here it is necessary for the denominators w;,+ in formulae (40.9), (40.11) 
to be non-zero when the energy E,, takes all values, not only of the discrete 
but also of the continuous spectrum. If, as usually happens, the continuous 





} ; st 1 
+ More precisely, if none is so small that the quantities axn") are no longer small compared 


with unity- 
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spectrum lies above all the levels of the discrete spectrum, then, for instance, 
the condition (40.10) must be supplemented by the condition 


ExiL—E, > hw, (40.12) 


where Emin” is the energy of the lowest level of the continuous spectrum. 


PROBLEM 


Determine the change in the mth and mth solutions of Schrédinger’s equation in the 
presence of a periodic perturbation (of the form (40.6)), of frequency w such that E,,(° — E, 
= h(w+e), where e is a smal] quantity. 


SOLUTION. The method developed in the text is here inapplicable, since the coefficient 
Amn’) in (40.9) becomes large. We start afresh from the exact equations (40.3), with V4.(t) 
given by (40.8). It is evident that the most important effect is due to those terms, in the 
sums on the right-hand side of equations (40.3), in which the time dependence is determined 
by the small frequency waa—w. Omitting all other terms, we obtain a system of two equa- 
tions: 


thda,jdt = F mnte ia pm. 
We make the substitution 


ARN 
aret b; 


and obtain the equations 
thd, = Fmnbn, th(b,—teb,) = Fmn*Qm- 


Eliminating a,,, we have 


6, —tebat-|Fanl2b,'h? = 0. 


We can take as two independent solutions of these equations 


a, = At, am = —Aliz,e!'F,,,.* (l) 
and 


a, SBA aa = Bio n, (2) 


where A and B are constants (which have to be determined from the normalization condition), 
and we have used the notation 


Se. oe wee 
Q = V(fet+\n]2). q = Fralh. 


Thus, under the action of the perturbation, the functions Af), Ym become a,'¥,' + 
FamF n, with a, and am given by (1) and (2). 
Let the system be in the state ¥_,{") at the initial instant (t = 0). The state of the system 


at subsequent instants is given by a linear combination of the two functions which we have 
obtained, which becomes Fn for t = 0: 


Y = etti2( cos Qt = zesin A) Fn — (?n*/OQ)e —iet/2 sin Qt Š Fa, (3) 


T 
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The squared modulus of the coefficient of Y, is 


2 
Maa - cos 291) (4) 


This gives the probability of finding the system in the state Y0 at time 2. We see that itisa 
periodic tunction with frequency 20, and varies from 0 to |n|?/Q?. 
For e = 0 (exact resonance) the probability (4) becomes 


A(1 — cos 2|y]2). 


It varies periodically between 0 and 1; in other words, the system makes periodic transitions 
from the state ‘Yin to the state ¥',(. 


§41. Transitions under a perturbation acting for a finite time 


Let us suppose that the perturbation V(t) acts only during some finite 
interval of time (or that V(t) diminishes sufficiently rapidly as £ > +œ). 
Let the system be in the mth stationary state (of a discrete spectrum) before 
the perturbation begins to act (or in the limit as t > —oo). At any subsequent 
instant the state of the system will be determined by the function 


P = Lag, O, 
k 


where, in the first approximation, 


r 


1 
an, = Akn” = >} | V pne een! dt for k Æ "n, 


Jä (41.1) 
Ann = 1+a,,% = 1—- Po dt; 


g0 


the limits of integration in (40.5) are taken so that, as tf > — œ, all the 
akn“) tend to zero. After the perturbation has ceased to act (or in the limit 
t > + œ), the coefficients ouy take constant values apn( oo), and the system 
is in the state with wave function 


Y = a arn (OFO, 


which again satisfies the unperturbed wave equation, but is different from 
the original function ¥,,°. According to the general rule, the squared 
modulus of the coefficient a,,(00) determines the probability for the system 
to have an energy E, i.e. to be in the Ath stationary state. 
Thus, under the action of the perturbation, the system may pass from its 
initial stationary state to any other. The probability of a transition from 
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the initial (zth) to the final ( f th) stationary state ist 


ra e) 


A i, Se 
m= = | Vpeont dt (41.2) 


Let us now consider a perturbation which, once having begun, continues 
to act for an indefinite time (always, of course, remaining small). In other 
words, V(ż) tends to zero as t > —oo and to a finite non-zero limit as £ > 
+œ. Formula (41.2) cannot be applied directly here, since the integral in 
it diverges. This divergence, however, is physically unimportant and can 
easily be removed. To do this, we integrate by parts: 


Fi] 


t t 
r è lo it t ° tw it 
arti = E V jelorit dz = — a ++ OV i emi dt. 
f f 
h nosy Jo Ct hws 
(>a) 


-%0 = 





The value of the first term vanishes at the lower limit, while at the upper 
limit it is formally identical with the expansion coefficients in formula (38.8); 
the presence of an additional periodic factor e#~,;£ is merely due to the fact 
that the af; are the expansion coefficients of the complete wave function Y, 
while the cy; in §38 are the expansion coefficients of the time-independent 
function ys. Hence it is clear that its limit as £ > œ gives simply the change 
in the original wave function ¥; under the action of the “‘constant’’ part 
V(+ œ) of the perturbation, and consequently has no relation to transitions 
into other states. The probability of a transition is given by the squared 
modulus of the second term and is 





hew? | | 


z 2 
Wri = I |= eopit dt . (41.3) 


The derivation is also valid when the transition is from a state of the discrete 
spectrum to a state of the continuous spectrum. The only difference is that 
here we have the probability of the transition from a given (ith) state to states 
in a range of values of vy (see the end of §38) from vy to vf + dvg, so that, for 
example, formula (41.2) must be written 


Kew 


dw; = 


t = 9 
a | V petorit dt dvr. (41.4) 
3 


If the perturbation V() varies little during time intervals of the order of 
the period 1/wp; the value of the integral in (41.2) or (41.3) will be very 





t For uniformity, the initial and final states will henceforward be denoted by i and f when 
transition probabilities are discussed. The suffixes of these probabilities will be written in 
the order fi, the same as for matrix elements. 
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small. In the limit when the applied perturbation varies arbitrarily slowly, the 
probability of any transition with change of energy (i.e. with a non-zero 
frequency wyi) tends to zero. Thus, when the applied perturbation changes 
sufficiently slowly (adiabatically), a system in any non-degenerate stationary 
state will remain in that state (see also §53). 

In the opposite limiting case of a very rapid, “instantaneous” application of 
the perturbation, the derivatives €V»;/€t become infinite at the ‘instant of 
application”. In the integral of (êV p/ôtjeient, we can take outside the 
integral the comparatively slowly varying factor e‘»;,! and use its value at 
this instant. The integral is then found at once, and we obtain 


Wr = | V pg]? /h2w p? (41.5) 


The transition probabilities in instantaneous perturbations can also be 
found in cases where the perturbation is not small. Let the system be in a 
state described by one of the eigenfunctions 4; of the original Hamiltonian 
Ho. If the change in the Hamiltonian occurs instantaneously (i.e. in a time 
short compared with the periods 1/w,; of transitions from the given state i 
to other states), then the wave function of the system is “unable” to vary and 
remains the same as before the perturbation. It will no longer, however, be 
an eigenfunction of the new Hamiltonian A of the system, i.e. the state 
yi will not be a stationary state. The probabilities wp; for transitions of 
the system into the new stationary states are determined, according to the 
general rules of quantum mechanics, by the coefficients in the expansion of the 
function y; in terms of the eigenfunctions yp of the Hamiltonian Ê: 


wy = | | Pay dgl®. (41.6) 


We shall show how this general formula becomes (41.5) if the change 
V = H—AH) in the Hamiltonian is small. We multiply the equations 


opi = BOY, Ary = Ep” 


by wr* and Y; respectively, integrate with respect to g and subtract. 
Using also the self-conjugacy of the operator H, we obtain 


(Ey— Ei) | bp dg = f it Py dg. 


If the perturbation V is small, in the first approximation we can replace 
Eş by the adjoining unperturbed level E;©, and the wave function #y (on 


the right-hand side of the equation) by the corresponding function p,'°. 
This gives 


1 
#10) dg = —— ()*7y,,(0 dg, 
| wp hi dg kee | by y 


and formula (41.6) becomes (41.5). 


§41 Transitions under a perturbation acting for a finite time 149 
PROBLEMS 


PROBLEM 1. A uniform electric field is suddenly applied to a charged oscillator in the 
ground state. Determine the probabilities of transitions of the oscillator to excited states 
under the action of this perturbation. 


SoLuTion. The potential energy of the oscillator in the uniform field (which exerts a 
force F on it) is 


U(x) = dmat Fx 


= (mw*(x — xu) + constant 


(where xo = F mw"), i.e. has still the pure oscillator form but with the equilibrium position 
shifted. Hence the wave functions of the stationary states of the perturbed oscillator are 
Ye(x—xo), where p(x) are the oscillator functions (23.12); the initial wave function is 
¥o(x) (23.13). Using these functions and the expression (23.11) for the Hermite polynomials, 
we find 








i 1): f o d 
| Loe da = ( a 1] e-S$0—_e-E- E50 de 
ri \ (Erk!) dgk ài 


with the notation o = xoy (mw ñi). On integrating k times by parts, the integral ôn the 
right becomes 
ms 
Čo! | e-se dé = Ey y TOST. 1 


~X 


Thus the transition probability (41.6) is 


ke | 
wo, = eu k = tég? = F2/2mhes. 


As a function of the number & it represents a Poisson distribution for which the mean value 
of kis k. b 
Perturbation theory is applicable when F is small, so that k <1. Then the excitation 


probabilities are small, and decrease rapidly with increasing k. The largest is w ~ R. 
In the opposite case of large F (k > 1), excitation of the oscillator occurs with very high 
probability: the probability that the oscillator will remain in the normal state is woo = e7*. 


PROBLEM 2. The nucleus of an atom in the normal state receives an impulse which gives 
it a velocity v; the duration r of the impulse is assumed short in comparison both with the 
electron periods and with a/v, where ais the dimension of the atom. Determine the probability 
of excitation of the atom under the influence of such a “jolt” (A. B. Migdal 1939). 


SOLUTION. We use a frame of reference K’ moving with the nucleus after the impact. 
By virtue of the condition 7 < a/v, the nucleus may be regarded as practically stationary 
during the impact, so that the coordinates of the electrons in K’ and in the original frame K 
immediately after the perturbation are the same. The initial wave function in K’ is 


Yo’ = dp exp(—iq. X Ta), q = mvh, 


where yo is the wave function of the normal state with the nucleus at rest, and the summation 


in the exponent is over all Z electrons in the atom. The required probability of transition 
to the kth excited state is now given, according to (41.6), by 


wko = |<k] exp (—-1q. x ¥q)|O>|2. 


In particular, if ga <1, then by expanding the exponential factor in the integrand and noting 
that the integral of Yk#*ýo is zero because the functions wo and yx are orthogonal, we obtain 


wko = |<kjią . 2 ra)|0>j?. 
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PROBLEM 3. Determine the total probability of excitation and ionization of an atom of 
hydrogen which receives a sudden “‘jolt’’ (see Problem 2). 


SOLUTION. ‘The required probability can be calculated as the difference 
1— wo = 1—| [ goear aV}, 


where wo 1s the probability that the atom will remain in the ground state (Ya = (na?) -Yera 
being the wave function of the ground state of the hydrogen atom, with a the Behr radius) 
Calculation of the integral gives 


1—woo = 1—1/(1 +4 9202), 


In the limiting case ga < 1 this probability tends to zero as g?a?, while for ga ® 1 it tends to 
unity as 1 —(2/ga)®. 

PROBLEM 4. Determine the probability that an electron will leave the K-shell of an atom 
with large atomic number Z when the nucleus undergoes B-decay. The velocity of the 


B-particle is assumed large in comparison with that of the K-electron (A. B. Migdal and 
E. L. Feinberg 1941). 


SOLUTION.{ In the conditions stated the time taken by the -particle to pass through the 
K-shell1s small compared with the period of revolution of the electron, so that the change in 
the nuclear charge can be regarded as instantaneous. The perturbation is here represented 
by the change V = 1/7 in the field of the nucleus when the change in its charge is small 
(1 compared with Z). According to (41.5) the transition probability for one of the two K-shell 
electrons with energy Eo = —}Z? (here and below we use the fact that the state of the K- 
electrons is hydrogen-like; see §74) to a state of the continuous spectrum with energy 
E = łk? in the range dE = k dk is 


4 Vogl? 
(k?-+ Z2)? 


In the range which determines the matrix elemerit Vox, the important part is that of short 
distances (~1/Z) from the nucleus, in which the hydrogen-like expression can again be used 
for the wave function of a state of the continuous spectrum. The final state of the electron 
must have angular momentum / = 0 (the same as that of the initial state). By means of the 
functions Rio, and Rro (normalized on the &/2z scale), derived in §36 and formula (f.3) in the 
Mathematical Appendices we find{ 


i 4a/(2mk) (1 +0k/Z)'Z/4(1 —k/Z)-tz 
(-) iv 4/(1--e72nZ/k) 14 k2/Z2 


and, since \(1+2a)!/¢]2 = exp[—(2/a) tan! a}, 


27 
f(R/Z)k dk, 


we obtain finally du’ = Z4(1 + 2/Z2)4 


] 
with f(x) = Co exp[—(4/a) tanta] 


The limiting values of the function f(a) are e~4 for « €1 and a/2r for a > 1. 
'The totai probability of 1onization of the K-shell is obtained by integration of dw over all 
energies of the emergent electron. A numerical evaluation gives w = 0.65272. 


PROBLEM 5. Determine the probability of emergence of an electron from the K-shell 
of an atom with large Z in «-decay of the nucleus. The velocity of the «-particle is small 





+ In Problems 4 and 5, atomic units are used. . | 
t In the calculation it is convenient to use Coulomb units and then return to atomic units 


in the final result. 
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compared with that of the K-electron, but the time which it takes to leave the nucleus is small 
in comparison with the time of revolution of the electron (A. B. Migdal 1941, J. S. Levinger 
1953). 


SOLUTION. After the emergence of the a-particle, the perturbation acting on the electron 
is adiabatic. The required effect is therefore determined essentially by the interval of time 
close to the “instant of application’’ of the perturbation which destroys the adiabaticity, when 
the «-particle, leaving the nucleus and moving freely, is still at a distance small compared with 
the radius of the K-orbit. The perturbation V which causes the ionization of the atom is 
here represented by the deviation of the combined field of the nucleus and the «-particle 
from the purely Coulomb field Z/r. The dipole moment of two particles with atomic weights 
4 and A—4, and charges 2 and Z—2, at a distance vt apart (where v is the relative velocity of 


the nucleus and the a-particle), is 
HAAZ A= 22) 
rt ee 


A A 


Hence the dipole term in the field of the nucleus and the «-particle is+ 





where the z-axis is in the direction of the velocity v. The matrix element of this perturbation 
reiuces to that of z: taking the matrix element of the equation of motion of the electron 
z = —Z2z/r3, we obtain 


(=/r3)ox = (E— Eo)*=04/Z. 


The required transition probability for one of the two electrons in the K-shell is, by (41.2), 


[a 5] 


` 2 
da = 2| | Foxe*#e-£" di] dk 


0 
S(A—2Z}v? dk 
= ————| zaf —;} 
A?Z? 2r 


to calculate the integral, we include in the integrand an additional damping factor e~*¢ with 
A > 0, and then make A —> 0 in the result. To calculate the matrix element of = = r cos 7, 
we note that, since the orbital angular momentum in the initial state is / = 0, cos 8 has a non- 
zero matrix element only for the transition to a state with l = 1, and 


08 Ojj? = 4 
cai [(cos oil” = 3 
|sox|* =- 3lrox|"- 
Calculating rox by means of the radial functions Roo and Rei, we find 
21,A—2Z)?22 
ey = —________ f(k/Z)k dh, 
3AZZ5(1 + kh? /Z-)5 


the function f being as in Problem 4. 


§42. Transitions under the action of a periodic perturbation 


The results are different for the probability of transitions to the states of 
the continuous spectrum under the action of a periodic perturbation. Let 





t If the difference Æ —2Z is small, it may be necessary to take account of the next (quad- 
rupole) term also. 
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us suppose that, at some initial instant ¢ = 0, the system is in the ¿th station- 
ary state of the discrete spectrum. We shall assume that the frequency w of 
the periodic perturbation is such that 


hw > Emin — Ei, (42.1) 


where Emin is the value of the energy where the continuous spectrum begins. 

It is evident from the results of §40 that the chief part will be played by 
states of the continuous spectrum with energies E; very close to the resonance 
energy E: + hw, i.e. those for which the difference wp; —w is small. For this 
reason it is sufficient to consider, in the matrix elements (40.8) of the pertur- 
bation, only the first term (with the frequency wf: —w close to zero). Sub- 
stituting this term in (40.5) and integrating, we obtain 


t 
etlws;-w)t — J] 


afi = a V flt) dt = — Ff 


0 


j 42.2 
h(wfi— w) ne 


The lower limit of integration is chosen so that aşı = 0 for t = 0, in accord- 
ance with the initial condition imposed. 
Hence we find for the squared modulus of ay; 


layi|* = [E rl? .4 sin2[4(w;j —w)t] (esi —w)?, (42.3) 


It is easy to see that, for large ż, this function can be regarded as propor- 
tional to t. To show this, we notice that 





lim — = ŝa). (42.4) 


For when « Æ 0 this limit is zero, while for 2 = 0 we have (sin?at)/te? = t, 
so that the limit is infinite; finally, integrating over « from —œ to +0, 
we have (with the substitution at = &) 


i Rane E n 
: f E Sa { de 1. 
ny E 


= ta? 
— 00 








Thus the function on the left-hand side of equation (42.4) in fact satisfies all 
the conditions which define the delta function. Accordingly, we can write 
for large ż 


jagi|? = (1/h?)| Fj rte desi — $e), 
or, substituting Awg = Ey—E; and using the fact that (ax) = (1/a)8(x), 
jasi]? = (2r jh)|F p| (Ez — Ei — hw)t. 
The expression |ayi|2 dvz is the probability of a transition from the original 


state to one in the interval dyf. We see that, for large £, it is proportional to the 
time interval elapsed since £ = 0. The probability dwz: of the transition per 
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unit time ist 


dw x; = (27ih) Erl SEs — E;© — hw) dvs. (42.5) 


As we should expect, it is zero except for transitions to states with energy 
Ey = E,{+hw. If the energy levels of the continuous spectrum are not 
degenerate, so that vy can be taken as the value of the energy alone, then the 
whole “‘interval”’ of states dv; reduces to a single state with energy E = E + 
+ ñw, and the probability of a transition to this state is 


wm = (27/h)| Fri|?. (42.6) 


There is another method of deriving formula (42.5) that is methodologically 
instructive, in which the periodic perturbation ts assumed not to be applied at 
atime tf = 0 but to increase slowly from t = — œ by an exponential law e#é 
with a positive constant A which is then made to tend to zero (adiabatic 
switch-on). The initial condition ay; = 0 is accordingly applied att = — œ. 
The matrix element of the perturbation now has the form 


V aP = a li 
and (42.2) becomes 


afi = — A | Vilt) dt 


ellwsj-a)t +t 


ee ay 42.7 
E a y eel 
Hence 
1 peut 
apil? = — |F plè ——___"__. 
an A fil (wi — w)? + À? 
The transition probability per unit time is given by the derivative 
djaz|2/dt = 2Alay|?. 
There is a formula 
e(a), (42.8) 


im ———_———. = 
a~o ma2+ À?) 
valid in the same sense as (42.4); with this we find, taking the limit A > 0, 


di, 2) T 
C lanl? > Z| Fallon- o), 


and thus return to (42.5). 





t It is easy to verify that, on taking account of the second term in (40.8), which we have 
omitted, additional expressions are obtained which, on being divided by £, tend to zero as 
ł > + œ. 
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§43. Transitions in the continuous spectrum 


One of the most important applications of perturbation theory is to calculate 
the probability of a transition in the continuous spectrum under the action 
of a constant (time-independent) perturbation. We have already mentioned 
that the states of the continuous spectrum are almost always degenerate. 
Having chosen in some manner the set of unperturbed wave functions cor- 
responding to some given energy level, we can put the problem as follows. 
It is known that, at the initial instant, the system is in one of these states; 
it is required to determine the probability of the transition to another state 
with the same energy. For transitions from the initial state z to states between 
vf and vy+ dv we have at once from (42.5) (putting œw = 0 and changing the 
notation) 

dws, = (27r/h)|V jal S(Es— Ei) dvp. (43.1) 


This expression is, as we should expect, zero except for Ey = E;: under 
the action of a constant perturbation, transitions occur only between states 
with the same energy. It must be noticed that, for transitions from states 
of the continuous spectrum, the quantity dw” cannot be regarded directly 
as the transition probability; it is not even of the right dimensions (1/time). 
Formula (43.1) represents the number of transitions per unit time, and its 
dimensions depend on the chosen method of normalization of the wave 
functions of the continuous spectrum.+ 

Let us calculate the perturbed wave function, which before the action of 
the perturbation is the same as the original unperturbed function y;. 
Using the method given at the end of §42, we can regard the perturbation as 
being adiabatically applied according to e4@ with A +0. From (42.7), 
putting w = 0 and changing the notation, we have 


EP {(2/h)(Ey— Ei)t + At} 


mop 
on í Be 


(43.2) 


The perturbed wave function is 
Af ist PUO) Í a DFO dvr, 


where the integration is extended over the whole of the continuous spectrum. f 
Substitution of (43.2) gives 


, ; dvs l ( i ) 
F, = | y0 Vir 0) expl -Eit \. (43.3) 
' E + | WE RE; eOl ih 


+ The phenomena comprised within the theory here discussed include, for example, various 
types of collision; the system in its initial and final states is a set of free particles and the 
perturbation is the interaction between them. With appropriate normalization of the wave 
functions, (43.1) may then be the collision cross-section (see §1 26). . 

t If there is also a discrete spectrum, then we must add to the integral in this formula (and 
subsequent ones) the appropriate sum over the states of the discrete spectrum. 
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In the limit as A > 0, the factor eżt becomes unity. The term +720, denoting 
the limit of iA as A tends to zero from positive values, determines the manner 
of integration with respect to the variable Ey (dE; occurs as a factor in dv¢ 
together with the differentials of other quantities which describe the states of 
the continuous spectrum). Without the term 7A, the integrand in (43.3) 
would have a pole at Ey = E;, near which the integral would diverge. The 
term 7A moves this poie into the upper half-plane of the complex variable Ey. 
After the limit A —> 0 is taken, the pole returns to the real axis, but we know 
that the path of integration must pass beneath it: 


E. 


ee eS (43.4) 
E 


f 


The time factor in (43.3) shows that this function belongs, as it should, to 
the same energy E; as the original unperturbed function. In other words, the 
function 


Vi 
aai | Ex Erro” á 


satisfies Schrédinger’s equation 


(Ho+ Vy = Epi. 


It is therefore natural that the expression obtained should correspond exactly 
to (38.8). 

The calculations given above correspond to the first approximation of 
perturbation theory. It is not difficult to calculate the second approximation 
as well. To do this, we must derive the formula for the next approximation 
to ¥;; this is easily effected by using the method of §38 (now that we know the 
method of dealing with the “divergent” integrals), A simple calculation 
gives the formula 


ij B e 
E ORRE VA | ee d | x 
i fy | | fi | aE v 


x He dor e-lh Eit, (43.5) 
E; — Ef+ 10 


Comparing this expression with formula (43.3), we can write down the 
corresponding formula for the probability (or, more precisely, the number) 


t With this formula, the way in which the integral is to be taken can be found from the 


condition that the asymptotic expression for pi at large distances should contain only an 
outgoing (and not an ingoing) wave (see §136). 


OmM3-F ® 
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of transitions, by direct analogy with (43.1): 


eae 


2 
— 2 dv (E; — Er) dvr. ; 
Pepa (E;— Ey) dv (43.6) 


dws; = = Vit | 


It may happen that the matrix element Vy; for the transition considered 
vanishes. The effect is then zero in the first approximation, and (43.6) 
becomes 

271 Vy Vu 2 
dwg = — | ——— dv &E;- Ej) dry. 
eS al ine (Ey — Ei) dvy (43.7) 


In applications of this formula, the point Ey = F; is not usually a pole of the 
integrand ; the manner of integrating with respect to E, is then unimportant, 
and the integral can be taken along the real axis. 

The states v for which Vf, and V,; are not zero are usually called inter- 
mediate states for the transition z —> f. Intuitively, we may say that this 
transition takes place as if in two steps? > vand v > f (but such a descriptio” 
must not be taken literally, of course). It may happen that the transition 
1 > f can take place not through one but only through several successive 
intermediate states. Formula (43.7) can be at once generalized to such cases. 
For example, if two intermediate states are needed, we have 


ar PER ONT ae i2 
donss S ee T dy CAE Oa: 43.8 
ü hom E da i oi al 


Lastly, to clarify the mathematical significance of the integrals taken along 
a path of the form (43.4), we shall prove the formula 


ioe LO) + inf (a), (43.9) 


x—-a—10 


where the integration is along a segment of the real axis including the point 
x = a. If we pass round the pole x = aalong a semicircle of radius p, we find 
that the whole integral is equal to the sum of the integrals along the real axis 
from the lower limit to a— pand from a+ p to the upper limit, together with 
im times the residue of the integrand at the pole. In the limit p > 0, the 
integrals along the real axis make the integral along the complete segment, 
taken as a principal value (denoted by P), and the result is (43.9), which may 
also be symbolically written 


leah a (el 


x—a—1i0 v-a 


+i78(x — a); (43.10) 





P here denotes the taking of the principal value when integrating the function 


f (*®)(* — @). 
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§44. The uncertainty relation for energy 


Let us consider a system composed of two weakly interacting parts. We 
suppose that it is known that at some instant these parts have definite values 
of the energy, which we denote by E and e respectively. Let the energy be 
measured again after some time interval At; the values E’, e’ obtained are 
in general different from F, e. It is easy to determine the order of magnitude 
of the most probable value of the difference E’ + e’ —E—e which is found asa 
result of the measurement. 

According to formula (42.3) with w = 0, the probability of a transition of 
the system (after time ¢), under the action of a time-independent perturbation, 
from a state with energy E to one with energy E’ is proportional to 


sin?{(E’—E)t/2h]/(E’ —E)*. 


Hence we see that the most probable value of the difference E’ —E is of the 
order of i/t. 

Applying this result to the case we are considering (the perturbation being 
the interaction between the parts of the system), we obtain the relation 


\E+e—E’—e'JAt ~R. (44.1) 


Thus the smaller the time interval At, the greater the energy change that is 
observed. It is important to notice that its order of magnitude #/At is inde- 
pendent of the amount of the perturbation. The energy change determined 
by the relation (44.1) will be observed, however weak the interaction between 
the two parts of the system. This result is peculiar to quantum theory and has 
a deep physical significance. It shows that, in quantum mechanics, the law 
of conservation of energy can be verified by means of two measurements only 
to an accuracy of the order of h/ At, where At is the time interval between the 
measurements. 

The relation (44.1) is often called the uncertainty relation for energy. How- 
ever, it must be emphasized that its significance is entirely different from 
that of the uncertainty relation A\pAx ~ Å for the coordinate and momen- 
tum. In the latter, Ap and Ax are the uncertainties in the values of the 
momentum and coordinate at the same instant; they show that these two 
quantities can never have entirely definite values simultaneously. The 
energies E, e, on the other hand, can be measured to any degree of accuracy 
at any instant. The quantity (E+e«)—(E’ +e’) in (44.1) is the difference 
between two exactly measured values of the energy E+e at two different 
instants, and not the uncertainty in the value of the energy at a given instant. 

If we regard E as the energy of some system and e as that of a ‘‘measuring 
apparatus’, we can say that the energy of interaction between them can be 
taken into account only to within h/At. Let us denote by AE, Ae, ... the 
errors in the measurements of the corresponding quantities. In the favour- 
able case when e, e' are known exactly (Ae = Ae’ = 0), we have 


A(E—E’) ~ hij At. (44.2) 
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From this relation we can derive important consequences concerning the 
measurement of momentum. The process of measuring the momentum of a 
particle (for definiteness, we shall speak of an electron) consists in a collision 
of the electron with some other (‘‘measuring’”’) particle, whose momenta 
before and after the collision can be regarded as known exactly.t If we apply 
to this collision the law of conservation of momentum, we obtain three equa- 
tions (the three components of a single vector equation) in six unknowns 
(the components of the momentum of the electron before and after the col- 
lision). The number of equations can be increased by bringing about a 
series of further collisions between the electron and ‘‘measuring”’ particles, 
and applying to each collision the law of conservation of momentum. This, 
however, increases the number of unknowns also (the momenta of the electron 
between collisions), and it is easy to see that, whatever the number of col- 
lisions, the number of unknowns wil! always be three more than the number 
of equations. Hence, in order to measure the momentum of the electron, 
it is necessary to bring in the law of conservation of energy at each collision, 
as well as that of momentum. The former, however, can be applied, as we 
have seen, only to an accuracy of the order of #/At, where At is the time be- 
tween the beginning and end of the process in question. 

To simplify the subsequent discussion, it is convenient to consider an 
imaginary idealized experiment in which the “measuring particle” is a 
perfectly reflecting plane mirror; only one momentum.component is then 
of importance, namely that perpendicular to the plane of the mirror. To 
determine the momentum P of the particle, the laws of conservation of 
momentum and energy give the equations 


p'+P'—p—P =0, (44.3) 


le’ E’—e—E} ~ hifAt, (44.4) 


where P, E are the momentum and energy of the particle, and p, «e those of 
the mirror; the unprimed and primed quantities refer to the instants before 
and after the collision respectively. The quantities p, p’, e, e’ relating to the 
“measuring particle” can be regarded as known exactly, i.e. the errors in 
them are zero. Then we have for the errors in the remaining quantities, 
from the above equations: 


AP = AP’, |AE’—AE|~A/AL. 
But AE = (@E/aP)AP = vAP, where v is the velocity of the electron (before 
the collision), and sirnilarly AE’ = v'AP’ = v'AP. Hence we obtain 
(v .—v,)AP,|~A/AL. (44.5) 


We have here added the suffix x to the velocity and momentum, in order to 
emphasize that this relation holds for each of their components separately. 
This is the required relation. It shows that the measurement of the 


+ In the present analysis it is of no importance how the energy of the “measuring” particle 
is ascertained. 
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momentum of the electron (with a given degree of accuracy AP) necessarily 
involves a change in its velocity (i.e. in the momentum itself). This change 
becomes greater as the duration of the measuring process becomes shorter. 
The change in velocity can be made arbitrarily small only as At > æ, but 
measurements of momentum occupying a long time can be significant only 
for a free particle. The non-repeatability of a measurement of momentum 
after short intervals of time, and the “two-faced” nature of measurement in 
quantum mechanics—the necessity of a distinction between the measured 
value of a quantity and the value resulting from the process of measurement— 
are here exhibited with particular clarity.T 

The conclusion reached at the beginning of this section, which was based 
on perturbation theory, can also be derived from another standpoint by con- 
sidering the decay of a system under the action of some perturbation. Let 
E, be some energy level of the system, calculated without any allowance for 
the possibility of its decay. We denote by 7 the lifetime of this state of the 
system, i.e. the reciprocal of the probability of decay per unit time. Then 
we find by the same method that 


|E, —E—e| ~ hjr, (44.6) 


where ÈE, e are the energies of the two parts into which the system decays. 
The sum E'-+e, however, gives us an estimate of the energy of the system 
before it decays. Hence the above relation shows that the energy of a system, 
in some “‘quasi-stationary”’ state, which is free to decay can be determined 
only to within a quantity of the order of |7. This quantity is usually called 
the width I of the level. Thus 


Pe. (44.7) 


§45. Potential energy as a perturbation 


The case where the total potential energy of the particle in an external 
field can be regarded as a perturbation merits special consideration. The 
unperturbed Schrédinger’s equation is then the equation of free motion of 
the particle: 


Apo Ryo =0, k = 4/(2mE/h) = pli, (45.1) 


and has solutions which represent plane waves. The energy spectrum of 
free motion is continuous, so that we are concerned with an unusual case of 
perturbation theory in a continuous spectrum. The solution of the problem 
is here more conveniently obtained directly, without having recourse to 
general formulae. 


The equation for the correction ¥® to the wave function in the first ap- 
proximation is 
AYL k2YO = (2m U/ 2) y/o, (45.2) 


t The relation (44.5) and the elucidation of the 


physical significance of the uncertainty 
relation for energy are due to N. Bohr (1928). 
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where U is the potential energy. The solution of this equation, as we know 
from electrodynamics, can be written in the form of retarded poteniials, i.e. 
in the formt 


V(x, y, 2) = —(m/2nh?) | YOU (x, y’, z'er AV" |r, (45.3) 


where 
dV’ = dx'dy’d2’, r? = (x—x’P+(y—y’)?+(2—2’)?. 


Let us find what conditions must be satisfied by the field U in order that 
it may be regarded as a perturbation. The condition of applicability of per- 
turbation theory is contained in the requirement that 44) <7, Let a be 
the order of magnitude of the dimensions of the region of space in which the 
field is noticeably different from zero. We shall first suppose that the energy 
of the particle is so small that ka is at most of the order of unity. Then the 
factor e#*r in the integrand of (45.3) is unimportant in an order-of-magnitude 
estimate, and the integral is of the order of 2 |U|a?, so that 


JO ~ mj Ulery, 
and we have the condition 


|U| < h?/ma? (fur ka = 1). (45.4) 


We notice that the expression on the right has a simple physical meaning; 
it is the order of magnitude of the kinetic energy which the particle would 
have if enclosed in a volume of linear dimensions a (since, by the uncertainty 
relation, its momentum would be of the order of A/a). 

Let us consider, in particular, a potential well so shallow that the condition 
(45.4) holds for it. It is easy to see that in such a well there are no negative 
energy levels (R. Peierls 1929); this has been shown, for the particular case 
of a spherically symmetric well, in §33, Problem. For, when E = 0, the 
unperturbed wave function reduces to a constant, which can be arbitrarily 
taken as unity: f@ = 1. Since J) <4) it is clear that the wave function 
$ = 1+%% for motion in the well nowhere vanishes; the eigenfunction, 
being without nodes, belongs to the normal state, so that E = 0 remains the 
least possible value of the energy of the particle. Thus, if the well is suffi- 
ciently shallow, only an infinite motion of the particle is possible: the particle 
cannot be “‘captured”’ by the well. Note that this result is peculiar to quantum 
theory; in classical mechanics a particle can execute a finite motion in any 
potential well. 

It must be emphasized that all that has been said refers only to a three- 
dimensional well. In a one- or two-dimensional well (i.e. one in which the 
field is a function of only one or two coordinates), there are always negative 


+ This is a particular integral of equation (45.2), to which we may add any solution of the 
same equation with zero on the right-hand side, i.e. the unperturbed equation (45.1). 
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energy levels (see the Problems at the end of this section), This is related to 
the fact that, in the one- and two-dimensional cases, the perturbation theory 
under consideration is inapplicable for an energy E which is zero (or very 
small). 

For large energies, when ka > 1, the factor e” in the integrand plays an 
important part, and markedly reduces the value of the integral. The solution 
(45.3) in this case can be transformed; the alternative form, however, is more 
conveniently derived by returning to equation (45.2). We take as x-axis the 
direction of the unperturbed motion; the unperturbed wave function then 
has the form ©) = e*** (the constant factor is arbitrarily taken as unity). 
Let us seek a solution of the equation 


AYO 4 Ry = (2mh?) Ueikz 


in the form Y = e'**f; in view of the assumed large value of k, it is suff- 
cient to retain in A") only those terms in which the factor e*** is differen- 
tiated one or more times. We then obtain for f the equation 


2ik flax = 2mUIh?, 
whence 


yD = eiktf = —(im/hPk)et**[ U dx. (45.5) 


An estimation of this integral gives |Y®| ~ m|U|a/h?k, so that the con- 
dition of applicability of perturbation theory in this case is 


|U| < (#?/ma*)ka = hv/a (ka > 1), (45.6) 


where v = khi/m is the velocity of the particle. It is to be observed that this 
condition is weaker than (45.4). Hence, if the field can be regarded as a 
perturbation at small energies of the particle, it can always be so regarded at 
large energies, whereas the converse is not necessarily true. f 

The applicability of the perturbation theory developed here to a Coulomb 
field requires special consideration. Ina field where U = afr, it is impossible 
to separate a finite region of space outside which U is considerably less than 
inside it. The required condition can be obtained by writing in (45.6) a 
variable distance r instead of the parameter a; this leads to the inequality 


ajhv < 1. (45.7) 





t In the two-dimensional case ¥” ıs expressed (as is known from the theory of the two- 
dimensional wave equation) as an integral similar to (45.3), in which, instead of eit" dx'dy'dz’jr 
we have t7H,\(kr) dx’dy’, where H” is the Hankel function and r? = (x—x’)?+(y—y’)?. 
As k > 0, the Hankel function, and therefore the whole integral, tend logarithmically to 
infinity. 

Similarly, in the one-dimensional case, we have, in the integrand, 27ie'*r dx’/k, where 
r = |x— x|, and as k > 0 y tends to infinity as 1/&, 

J In the one-dimensional] case the condition for perturbation theory to be applicable is 
given by the inequality (45.6) for all ka. The derivation of the condition (45.4) given above 
for the three-dimensional case is not valid in the one-dimensional case, 


4 l owing to the divergence 
of the resulting function ¢( (see the preceding footnote). 
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Thus, for large energies of the particle, a Coulomb field can be regarded as a 
perturbation.f 

Finally, we shall derive a formula which approximately determines the 
wave function of a particle whose energy E everywhere considerably exceeds 
the potential energy U (no other conditions being imposed). In the first 
approximation, the wave function depends on the coordinates in the same 
way as for free motion (whose direction is taken as the x-axis). Accordingly, 
let us look for ẹ in the form ẹ = e*2F, where F is a function of the co- 
ordinates which varies slowly in comparison with the factor e#** (but we 
cannot in general say that it is close to unity). Substituting in Schrédinger’s 
equation, we obtain for F the equation 


2tk OF [dx = (2m/h®) UF, (45.8) 
whence 
Y = e&k=F = constant x e*%e- Av Uaz, (45.9) 


This is the required expression. It should, however, be borne in mind that 
this formula is not valid at large distances. In equation (45.8) a term AF 
has been omitted which contains second derivatives of F. The derivative 
6°F/0x?, together with the first derivative @F/éx, tends to zero at large 
distances, but the derivatives with respect to the transverse coordinates y 
and z do not tend to zero, and can be neglected only if x < ka’. 


PROBLEMS 


PROBLEM 1. Determine the energy level in a one-dimensional potential well whose depth 
is small. It is assumed that the condition (45.4) is satisfied. 


SOLUTION. We make the hypothesis, which will be confirmed by the result, that the 
energy level |E| < |U|. Then, on the right-hand side of Schrédinger’s equation 


dyd = (2m/h*)[U(x)—E}y, 


we can neglect E in the region of the well, and regard ¢ as a constant, which without loss of 
generality can be taken as unity: 


GN, /dx? = 2mU/h? 


We integrate this equation with respect to x between two points +x, such that a < x, < 1/x, 
where a is the width of the well and x = +/(2m|E|/h?). Since the integral of U(x) converges, 
the integration on the right can be extended to the whole range from — œ to +œ: 


[= P =% f Yar (1) 


At large distances from the well, the wave function is of the form ¢ = e+*?. Substituting 
this in (1), we find 


—2n = (2m/ht) [ U dx 
+ It must be borne in mind that the integral (45.5) with a field U = ajr diverges (logarith- 


mically) when x/4/ (x? +2?) is large. Hence the wave function in a Coulomb field, obtained 
by A eane of perturbation theory, is inapplicable within a narrow cone about the x-axis. 
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or 


|E] = zr] | U axla 


We see that, in accordance with the hypothesis, the energy of the level is a small quantity of a 
higher order (the second) than the depth of the well. 


PROBLEM 2. Determine the energy level in a two-dimensional potential well U(r) (where 


r is the polar coordinate in the plane) of small depth; it is assumed that the integral [rU dr 
converges. 0 

SoLUTION. Proceeding as in the previous problem, we have in the region of the well the 
equation 


1 d/ dp 2m 
~—{ r—) = — U. 
r dr\ dr h? 


Integrating this with respect to r from 0 to 7, (where a < r, < 1/x), we find 


dy _ 2m r i 
[= TA = iy, (r) dr. (1) 


At large distances from the well, the equation of free motion in two dimensions is 


Ld y dy 2m, ; 
Ae T 


and has a solution (vanishing at infinity) y = ¢onstant x H,‘)(ixr); for small values of the 
argument, the leading term in this function is proportional to log xr. Bearing this in mind, we 
equate the logarithmic derivatives of ¢ for r ~ a inside the well (the right-hand side of (1)) 
and outside it, obtaining 


b 2m ¢ 
z — Í U'(r)r dr, 
Glogxa Ra : 


is e] 
2 a -1 
| E| ~ Fee |S | tr dr L 
ma? m | 
n 


We see that the energy of the level is exponentially small compared with the depth of the 
well. 





whence 


CHAPTER VII 


THE QUASP CLASSICAL GASH 


§46. The wave function in the quasi-classical case 


IF the de Broglie wavelengths of particles are small in comparison with the 
characteristic dimensions L which determine the conditions of a given 
problem, then the properties of the system are close to being classical, just as 
wave optics passes into geometrical optics as the wavelength tends to zero. 

Let us now investigate more closely the properties of guast-classical 
systems. To do this, we make in Schrédinger’s equation 


2 


h 
23 





A ab+(E— Oy = 0 


the substitution 


yy = die, (46.1) 


For the function o we obtain the equation 
5 1 5 ih 
2. — = ——s 
á Zma V 0) a E U. (46.2) 


Since the system is supposed almost classical in its properties, we seek o in 
the form of a series: 


o = co+(Ā/i)o,+(Ř/i) o+ ..., (46.3) 


expanded in powers of &. 
We begin by considering the simplest case, that of one-dimensional motion 
of a single particle. Equation (46.2) then reduces to 


o*/2m—tho” /2m = E— U(x), (46.4) 


where the prime denotes differentiation with respect to the coordinate x. 
In the first approximation we write o = o, and omit from the equation the 
term containing Å: 


og ?/2m = E— U(x). 
Hence we find 
oo = + | V{2m[E—U(s)]} dx. 
164 
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The integrand is simply the classical momentum p(x) of the particle, expres- 
sed as a function of the coordinate. Defining the function p(x) with the + 
sign in front of the radical, we have 


an Í pdx, p = v[2m{E—U)], (46.5) 


as we should expect from the limiting expression (6.1) for the wave function.t 
The approximation made in equation (46.4) is legitimate only if the second 
term on the left-hand side is small compared with the first, i.e. we must have 
Rio lo le lor 


\d(R/o’)/dx| < 1. 


In the first approximation we have, according to (46.5), o’ =p, so that the 
condition obtained can be written 


|d(A/2z)/dx| < 1, (46.6) 


where A(x) = 27h/p(x) is the de Broglie wavelength of the particle, expressed 
as a function of x by means of the classical function p(x). Thus we have 
obtained a quantitative quast-classicality condition: the wavelength of the 
particle must vary only slightly over distances of the order of itself. The 
formulae here derived are not applicable in regions of space where this condi- 
tion is not satisfied. 

The condition (46.6) can be written in another form by noticing that 


dp d m dU mF 
T = —y/[2m(E—U)] = — L =, 
dx dx p dx P 
where F = —dU/dx is the classical force acting on the particle in the external 
field. In terms of this force we find 
mh F\/p> < 1. (46.7) 


It is seen from this that the quasi-classical approximation becomes inapplic- 
able if the momentum of the particle is too small. In particular, it is clearly 
inapplicable near turning points, i.e. near points where the particle, according 
to classical mechanics, would stop and begin to move in the opposite direction. 
These points are given by the equation p(x) = 0, i.e. E = U(x). Asp +0, 
the de Broglie wavelength tends to infinity, and hence cannot possibly be 
supposed small. 

It must be emphasized, however, that the condition (46.6) or (46.7) alone 
may be insufficient for the quasi-classical approximation to be valid. The 
reason is that this condition has been derived from estimates of the various 
terms in the differential equation (46.4), the term omitted containing a higher 
derivative. It would be necessary, in fact, to stipulate the smallness of the 


+ As is well known, J p dx is the time-independent part of the action. The total mechanical 


action S of a particle is S = —Et+ Je dx. The term —Et is absent from o,, since we are 
considering a time-independent wave function 4. 
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subsequent expansion terms in the solution of this equation, and this need 
not be ensured by the smallness of the term omitted. For example, if the 
solution for o(x) contains a term which increases almost linearly with the 
coordinate x, the smallness of the second derivative in the equation will not 
prevent this term from becoming large at sufficiently great distances. Sucha 
situation occurs, in general, when the field extends to distances large in 
comparison with the characteristic length L over which it varies by an 
appreciable amount; see the discussion of (46.11) below. The quasi-classical 
approximation is then invalid for investigating the behaviour of the wave 
function at large distances. 

Let us now calculate the next term in the expansion (46.3). The first-order 
terms in f in equation (46.4) give 


oo 01 +46," = 0, 


whence 
o = —0 /20, = —p’/2p. 
Integrating, we find 
c, = —} log?, (46.8) 


omitting the constant of integration. 
Substituting this expression in (46.1) and (46.3), we find the wave function 
in the form 


Y = Cp Veet mip dz 4 Cp-l/2eHt nip dx, (46.9) 


The factor 1/+/p in this function has a simple interpretation. The proba- 
bility of finding the particle at a point with coordinate between x and x + dx 
is given by the square |y¥|?, i.e. is essentially proportional to 1/p. This is 
exactly what we should expect for a “‘quasi-classical” particle, since, in 
classical motion, the time spent by a particle in the segment dx is inversely 
proportional to the velocity (or momentum) of the particle. 

In the “classically inaccessible” parts of space, where E < U(x), the func- 
tion p(x) is purely imaginary, so that the exponents are real. The general form 
of the solution of the wave equation in these regions is 


Ci C2 eah f ipiaz, (46.10) 
vip] p| 


It must, however, be borne in mind that the accuracy of the quasi-classical 
approximation is not such as to allow the retention in the wave function of 
exponentially small terms superimposed on exponentially large ones, and in 
this sense it is usually not permissible to retain both terms in (46.10). 

Although there is, as a rule, no need to use the higher-order terms in the 
wave function, we shall derive the next term in the expansion (46.3), with a 
view to noting some aspects of the accuracy of the quasi-classical approxi- 
mation. 


e-l/ħ) | ipldz + 








yf = 
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The terms of order A? in equation (46.4) give 
Go Og +30,"+40," = 0, 


whence (substituting (46.5) and (46.8) for co and o1) 


~~ 


oz = p’/Ap?—3p?/8p". 


Integrating (by parts in the first term) and introducing the force F = pp’ |m, 
we obtain 


op = mF |p +m? | (F295) de. 
The wave function in this approximation is of the form 


yb = e/he — oli /ne,t,(1 _ific,) 
or 
constant 


VP 





p= [1—dimh F/p>—4ikim? | (F2/ps) dxJett/mfe ax, (46.11) 


The occurrence of imaginary correction terms in the coefficient of the 
exponential is equivalent to the presence of a similar correction in the phase of 
the wave function, i.e. of an addition to the integral (1/h) f p dx in its exponent. 
This correction is proportional to Ā, i.e. is of order A/L. 

The second and third terms in the brackets in (46.11) must be small in 
comparison with unity. For the second term, this condition is the same as 
(46.7); for the third term, an estimate of the integral gives (46.7) only if F2 
tends to zero sufficiently rapidly at distances ~ L. 


§47. Boundary conditions in the quasi-classical case 


Let x = a be a turning point, so that U(a) = E, and let U > E for all 
x > a, so that the region to the right of the turning point is classically 
inaccessible. ‘The wave function must be damped in this region. Sufficiently 
far from the turning point, it has the form 





yb = ( “| äl) fi (47.1) 
EA nl | 2% Or xX > @, ‘ 


corresponding to the first term in (46.10). To the left of the turning point, 
the wave function must be represented by a real combination (46.9) of two 
quasi-classical solutions of Schrédinger’s equation: 


(i (Ef dx) + 2 if dx fi 
xp{ — x —— ex _— — : fi 
n aj? V exp( -fo ) or x<a (47.2) 


168 The Quast-Classical Case §47 


To determine the coefficients in this combination we must follow the 
variation in the wave function from positive x—a (where (47.1) holds) to 
negative x—a. In doing so, however, it is necessary to pass through a region 
near the turning point where the quasi-classical approximation is invalid, 
and the exact solution of Schrédinger’s equation must be considered. For 
small |x — a| we have 


E- U(x) x Fo(x—a), Fo = —[dU/dx]z-q < 0: (47.3) 


that is, the problem in this region is one of movement in a homogeneous 
field. The exact solution of Schrédinger’s equation for this problem has been 
found in §24, and the relation between the coefficients in (47.1) and (47.2) 
can be derived by comparison with the asymptotic forms (24.5) and (24.6) of 
this exact solution on either side of the turning point. Here it must be noted 


that (47.3) gives p(x) = yv [2mFo(x —a)], so that the integral 


a | pdx = = v (2mFo)(x— a)*!? 


is equal to the argument of the exponential in (24.5) or the sine in (24.6). 
In this discussion it is important that the region where the expansion (47.3) 
is valid and the quasi-classical region partly overlap: if the motion is quasi- 
classical in almost the whole of the field region (as we assume), then there 
exist values of |x —a| small enough for the expansion (47.3) to be valid but 
also large enough for the quasi-classicality condition to be satisfied and for 
the asymptotic forms (24.5) and (24.6) to be applicable.t+ 

There is, however, another approach that is methodologically more 
instructive and does not make use of the exact solution. For this, y(x) must 
be formally regarded as a function of a complex variable x, and the passage 
from positive to negative x—a must be along a path which is always suffi- 
ciently far from the point x = a, so that the quasi-classicality condition is 
formally satisfied along the whole path (A. Zwaan 1929), We then again 
consider values of |x —a| such that the expansion (47.3) is also valid, so that 
the wave function (47.1) has the form . 


C 


1 a 
ws 2[2m|Fol(x—a)@ Y ü zl 


Vican e-ma j (47.4) 


Let us first examine the variation of this function on passing round the 
point x = a from right to left along a semicircle of radius p in the upper half- 


t The expansion (47.3) is valid for |x—a| < L, where L is the characteristic distance for 
variation of the field U(x). The quasi-classicality condition (46.7) requires that |x—al*/? > 
h/4/(m|F,|). These two conditions are compatible, since the quasi-classicality of the motion 
far from the turning-point (i.e. for |x—a| ~ L) implies that L? > &/+/(m|F,!). 
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plane of the complex variable x. On this semicircle, 
z 
x—a = p$, | V/(x—a) dx = 3 p%/*(cos $447 sin 34), 


the phase ¢ varying from 0 to m. The exponential factor in (47.4) at first 
(for 0 < $ < $7) increases in modulus, and then decreases to modulus 1. At 
the end of the semicircle the exponent becomes purely imaginary, equal to 


-$ | Ve2miFola—s)] ae = -$ | a) de 


a 


In the coefhcient of the exponential in (47.4), the change along the semicircle 
iS 
(x == a) —1/4 -> (a me x) -1/4¢ —tn/4_ 


Thus the whole function (47.4) becomes the second term in (47.2) with 
coefficient Cz = $Ce-t/4, 

The fact that by passing through the upper half-plane it is possible to 
determine only the coefficient C2 in (47.2) has a simple explanation. If we 
follow the variation of the function (47.2) along the same semicircle in the 
opposite direction (from left to right), we see that at the beginning the first 
term rapidly becomes exponentially small in comparison wth the second 
term. But the quasi-classical approximation does not allow us to include 
exponentially small terms in 4 superimposed on the large principal term, and 
this is why the first term in (47.2) is “lost” in the passage along the semicircle. 

To determine the coefficient C1, we must pass from right to left along a 
semicircle in the lower half-plane of the complex variable x. In a similar 
manner, we find that formula (47.4) then becomes the first term in (47.2) 
with coefficient Cy = 4Ce*/4, 

Thus the wave function (47.1) for x > a corresponds to the function 


= aes cos A È s+ r) 
y h 


a 


forx < a. This rule of correspondence may be written in a form independent 
of the side of the turning-point on which the classically inaccessible region 
lies: 


Tt 


C 1 Cha ii 
ooo a a — ou! d i —> E = d | EER, ly 47. 
ap exp | Fi f? =| Ue Cos f iE x 4 } (47.5) 
for U(x) > E for U(x) < E 
(H. A. Kramers 1926). 
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Let us once again emphasize what is obvious from the proof, namely that 
this rule is associated with a particular boundary condition imposed on one 
side of the turning-point, and in this sense it can be applied only in a particular 
direction. The rule (47.5) is derived with the boundary condition that Y > 0 
into the classically inaccessible region, and must be applied to a passage from 
the latter to the classically allowed region, as is shown by the arrow.} 

If the classically accessible region is bounded (at x = a) by an infinitely 
high “potential wall’, the boundary condition for the wave function at x = a 
is yy = 0 (see §18). The quasi-classical approximation is then valid up to 
the wall itself, and the wave function 1s 


G ] 
y= — sing | pdx for x <a, 
ve (47.6) 


yy = 0 for x > a. 


§48. Bohr and Sommerfeld’s quantization rule 


States that belong to the discrete energy spectrum are quasi-classical for 
high values of the quantum number 2, the ordinal number of the state, 
since this gives the number of nodes of the eigenfunction (see §21), and the 
distance between adjacent nodes is equal in order of magnitude to the de 
Broglie wavelength. For large n this distance is small, and the wavelength is 
therefore small in comparison with the dimensions of the region of the 
motion. 

Let us derive the condition which determines the quantum energy levels 
in the quasi-classical case. To do this we consider a finite one-dimensional 
motion of a particle in a potential well; the classically accessible region 
b < x < ais bounded by two turning points.] 

According to the rule (47.5), the boundary condition at x = b gives (in the 
region right of this point) the wave function 


C if 
= EF = al 48.1 
Y 7 cos io } (48.1) 


+ A passage in the opposite direction is meaningless in that even a small change of the wave 
function on the right in (47.5) may give rise to an exponentially increasing term in the function 
on the left. 

{ In classical mechanics, a particle in such a field would execute a periodic motion with 
period (time taken in moving from x = 6 to x = a and back) 


a f 
=2Í dx/v = 2m| Gx p, 
b 


where v is the velocity of the particle. 
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Applying the same rule to the region left of the point x = a, we obtain the 
same function in the form 


C’ nc 
y= — c Li ded], 
VP j AE 


If these two expressions are the same throughout the region, the sum of their 
phases (which is a constant) must be an integral multiple of 7: 


a 


p dx- $r = nz, 


ar | — 


with C =(—1YC'. Hence 


1 
zji dx = n+} (48.2) 


where $ p dx = 2 f p dx is the integral taken over the whole period of the 


d 
classical motion of the particle. This is the condition which determines 
the stationary states of the particle in the quasi-classical case. It corresponds 
to Bohr and Sommerfeld’s quantization rule in the old quantum theory. 


The quantity Z= (1/2n)$p dx is called an adiabatic invariant (see 
Mechanics, §49); the quantization condition (48.2) can be written as 


I(E) =h(n +3). 


It has already been mentioned in §41 that, when the parameters vary 
sufficiently slowly (“adiabatically”), the system remains in the same 
quantum state; in the present case, a state with a certain value of n. 
We see that in the quasi-classical limit this statement is the same as 
the classical theorem that the adiabatic invariant is constant when the 
parameters vary slowly. 

It is easy to see that the integer n is equal to the number of zeros of the 
wave function, and hence it is the ordinal number of the stationary state. 
For the phase of the wave function (48.1) increases from —}m at x = b to 
(n+4)z at x = a, so that the cosine vanishes z times in this range (outside 
the range b < x < a, the wave function decreases monotonically and has 
no zeros at a finite distance).+ 

As has been shown previously, the number z is large in the quasi-classical 
case. It must be emphasized, however, that the retention of the term 4 added 


+ Strictly speaking, the zeros should be counted by means of the exact form of the wave 
function near the turning points. If this is done, the result given in the text is confirmed. 
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to n in (48.2) is nevertheless legitimate: to take account of the subsequent 
correction terms in the phase of the wave functions would give only terms 
~A/L on the right of (48.2), which are small in comparison with unity; see 
the remark at the end of §46.t 

In normalizing these wave functions, the integration of |x|? can be re- 
stricted to the range b < x < a, since outside this range ys decreases exponen- 
tially. Since the argument of the cosine in (48.1) is a rapidly varying function, 
we can with sufficient accuracy replace the squared cosine by its mean value 3. 


This gives 
f dx 
2 dx x 4C? | —— 
[w TF p 


= nC? 2mw = 1, 


where w = 27/T is the frequency of the classical periodic motion. Thus 
the normalized quasi-classical function 1s 


W = = Cos HE dx— 4. (48.3) 


It must be recalled that the frequency w is in general different for different 
levels, being a function of energy. 

The relation (48.2) can also be interpreted in another manner. The 
integral ¢ p dx is the area enclosed by the closed classical phase trajectory 
of the particle (i.e. the curve in the px-plane, which ts the phase space of the 
particle). Dividing this area into cells, each of area 27h, we have n cells 
altogether; n, however, is the number of states with energies not exceeding 
the given value (corresponding to the phase trajectory considered). Thus 
we can Say that, in the quasi-classical case, there corresponds to each quantum 
state a cell in phase space of area 27h. In other words, the number of states 
belonging to the volume element ApAx of phase space is 


ApAx|2nh. (48.4) 


If we introduce, instead of the momentum, the wave number k = p/h, this 
number can be written 


AkAx/2n. 


It is, as we should expect, the same as the familiar expression for the number 
of characteristic vibrations of a wave field (see Fields, §52). 


+ In some cases the exact expression for the energy levels E() (as a function of the quantum 
number n), obtained from the exact Schrédinger’s equation, is such that it retains its form as 
n—> œ; examples are the energy levels in a Coulomb field, and those of a harmonic oscillator. 
In these cases, of course, the quantization rule (48.2), although really applicable only for large 
n, gives for the function E(n) an expression which is the exact one. 
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Starting from the quantization rule (48.2), we can ascertain the general 
nature of the distribution of levels in the energy spectrum. Let AE be the 
distance between two neighbouring levels, i.e. levels whose quantum numbers 
n differ by unity. Since AE is small (for large n) compared with the energy 
itself of the levels, we can write, from (48.2), 


AE d (pE) dx = 2nh. 
But dE/@p = v, so that 
$ (ap/2E) dx = $ dxjo = T. 


Hence we have 


AE = AT = ho. (48.5) 

Thus the distance between two neighbouring levels is Aw. The frequencies 
w may be regarded as approximately the same for several adjacent levels (the 
difference in whose numbers 7 is small compared with z itself). Hence we 
reach the conclusion that, in any small range of a quasi-classical part of the 
spectrum, the levels are equidistant, at intervals of iw. This result could 
have been foreseen, since, in the quasi-classical case, the frequencies cor- 
responding to transitions between different energy levels must be integral 
multiples of the classical frequency w. 

It is of interest to investigate what the matrix elements of any physical 
quantity f become in the limit of classical mechanics. To do this, we start 
from the fact that the mean value f in any quantum state must become, in 
the limit, simply the classical value of the quantity, provided that the state 
itself gives, in the limit, a motion of the particle in a definite path. A wave 
packet (see §6) corresponds to such a state; it is obtained by superposition of 
a number of stationary states with nearly the same energy. The wave func- 
tion of such a state is of the form 


Ws 2G. 7 a 
n 


where the coefficients a, are noticeably different from zero only in some 
range An of values of the quantum number n such that 1 < An < n; the 
numbers n are supposed large, because the stationary states are quasi-classical. 
The mean value of f is, by definition, 


f = | Y'FY dx = EE anan fmn ”m, 


or, replacing the summation over n and m by a summation over n and the 
difference m —n = s, 


— * l 
f T rh ants Qnfnts, ease 


where we have put w,,, = sw in accordance with (48.5). 
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The matrix elements f,,, Calculated by means of the quasi-classical wave 
functions decrease rapidly in magnitude as the difference m—n increases, 
though at the same time they vary only slowly with z itself (mn —n being fixed). 
Hence we can write approximately 


ae a iwet — 2 tw st 
i= XE ay anfen = È |an E fe ’ 


where we have introduced the notation f, = f;,,;,7 being some mean value 
of the quantum number in the range An. But È |a,|* = 1; hence 


f 2 E faa, 


The sum obtained is in the form of an ordinary Fourier series. Since f 
must, in the limit, coincide with the classical quantity f(t), we arrive at the 
result that the matrix elements fmn in the limit become the components fm-n 
in the expansion of the classical function f(t) as a Fourier series. 

Similarly, the matrix elements for transitions between states of the con- 
tinuous spectrum become the components in the expansion of f(t) as a Fourier 
integral. Here the wave functions of the stationary states must be normalized 
by (1/h) times the delta function of energy. 

All the above results can be generalized immediately to systems with 
several degrees of freedom, executing a finite motion for which the problem 
in classical mechanics allows a complete separation of the variables in the 
Hamilton-Jacobi method (called a conditionally periodic motion; see 
Mechanics, §52). After separation of the variables for each degree of freedom, 
the problem reduces to a one-dimensional problem, and the corresponding 
quantization conditions are 


f pi dgi = 2rh(ni+ yi), (48.6) 


where the integral is taken over the period of variation of the generalized 
coordinate g;, and y; is a number of the order of unity which depends on 
the nature of the boundary conditions for the degree of freedom considered} 

In the general case of an arbitrary (not conditionally periodic) motion in 
several dimensions the formulation of the quasi-classical conditions of 
quantization calls for more far-reaching considerations. The concept of 


+ For example, in motion in a centrally symmetric field we have 


§ prdr = 2nh(r+d), $ pode = 2nh(l—m+d), $ ped = 2rAm, 


where n, = n—l—1 is the radial quantum number. The last of the three equations simply 
expresses the fact that pẹ is the z-component of the angular momentum, equal to fiz. 
t See J. B. Keller, Annals of Physics 4, 180, 1958. 
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“cells” in phase space is, however, applicable (in the quasi-classical approxi- 
mation) in the same form always. This is clear from the above-mentioned 
relationship between it and the number of characteristic vibrations of the 
wave field in a given volume of space. In the general case of a system with s 
degrees of freedom, there are 


AN = Aq. ... AgsApy ... Aps/(2ah)5 (48.7) 


quantum states in a volume element in phase space. t 


PROBLEMS 


PROBLEM 1. Determine (approximately) the number of discrete energy levels of a particle 
moving in an arbitrary (not central) field U(r) which satisfies the quasi-classica! condition. 


SOLUTION. The number of states belonging to a volume of phase space which corresponds 
to momenta in the range 0 < p < pmax and particle coordinates in the volume element dV 
is 7Pmax® dV/(27h)?. For given r the particle can have (in its classical motion) a momentum 
satisfying the condition E = p2/2m + U(r) < 0. Substituting pmax = V[— 2mU(r)], we 
obtain the tocal number of states of the discrete spectrum: 


V2 mlz 


eee ROU ee 


where the integration is over the region of space in which U < 0. This integral diverges 
(i.e. the number of states is infinite) if U decreases at infinity as r~* with s < 2, in accordance 
with the results of §18. 


PROBLEM 2. The same as Problem 1, but for a quasi-classical centrally symmetric field 
U(r) (V. L. Pokrovskii). 


SOLUTION. In a centrally symmetric field the number of states is not the same as the 
number of energy levels, on account of the degeneracy of the latter with respect to the 
direction of the angular momentum. The required number can be found by noting that the 
number of levels with a given value of the angular momentum M is the same as the number 
of (non-degenerate) levels for a one-dimensional motion in a field with potential energy 
Uet = U(r)+ M?/2mr?. The maximum possible value of the momentum pr for given r and 
energies E < 0 is pr,max = V(~2mUerr). The number of states (i.e. the required number of 


levels) is therefore 
dr dp, 2 M? 
i g ee f -1-5 ) an 
2ah 27A 2mr2 


The required total number of discrete levels is obtained from this by integration with respect 
to M! (which replaces in the quasi-classical case the summation with respect to J), and is 


(m/4h?) | (—U)r dr. 


§¢9. Quasi-classical motion in a centrally symmetric field 


In motion in a centrally symmetric field the wave function of a particle 
falls, as we know, into an angular and a radial part. Let us first consider the 
former. 

The dependence of the angular wave function on the angle ¢ (determined 
by the quantum number m) is so simple that the question of finding approxi- 





t In particular, for one particle, d®°p/(27/i)* is the number of states for a range d*p of values 
of the momentum in unit volume of coordinate space. This explains the agreement of the 
two methods of normalizing the plane wave (15.8), mentioned in the footnote to that formula. 
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mate formulae for it does not arise. The dependence on the polar angle @ is, 
according to the general rule, quasi-classical if the corresponding quantum 
number / is large (this condition will be more precisely formulated below). 

We shall here confine ourselves to deriving the quasi-classical expression 
for the angular function for the case (the most important one in applications) 
of states whose magnetic quantum number is zero (m = 0).¢ This function 
is, apart from a constant factor, the Legendre polynomial P; (cos6) (see 
(28.8)), and satisfies the differential equation 


d?P,/d62-+-cot 6 dP;/d6+/+1)P; = 0. (49.1) 
The substitution 
Pi(cos 6) = x(@)/+/sin 8 (49.2) 
reduces this to 
x” +i) +} cosec"é]y = 0, (49.3) 


which does not contain the first derivative and is similar in appearance to 
the one-dimensional Schrédinger’s equation. 
In equation (49.3), the part of the de Broglie wavelength 1s played by 


A = 2r [(1-+4)?+4 cosect#]-2/2, 


The requirement that the derivative d(A/27)/dx is small (the condition (46.6)) 
gives the inequalities 


6l>1, (r—6)l 1, (49.4) 


which are the conditions that the angular part of the wave function is quasi- 
classical. For large / these conditions hold for almost all values of 6, exclud- 
ing only a range of angles very close to 0 or v. 

When the conditions (49.4) are satisfied, we can neglect the second term 
in the brackets in (49.3) compared with the first: 


x" +(-+4)*x = 0. 
The solution of this equation is 


y = ysin P(cos@) = A sin[(l+4)0 +a], (49.5) 


where A and « are constants. 
For argles 6 < 1, we can put in equation (49.1) cos 6 = 1/6; replacing 
also (1+1) by the approximation (l+ })?, we obtain the equation 
S a aipiei 
de "g dé ai 


+ The opposite case, m = /, must correspond in the limit to motion in a classical orbit 
lying in the equatorial plane 6 = 47, since P,(cos 6) = constant Xx sin? 6, and as / -> © 
this function (and therefore | ¥!*) tends to zero for all 8 # $r. 
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which has as solution the Bessel function of zero order: 


P,(cos@) = Jo[(/+4)6], 8< 1. (49.6) 


The constant factor is put equal to unity, since we must have P, = 1 for 
6 = 0, The approximate expression (49.6) for P, is valid for all angles 
6 <1. In particular, it can be applied for angles in the range 1/] < 0 <1, 
where it must agree with the expression (49.5), which holds for all 6> 1 jl. 
For 61> 1 the Bessel function can be replaced by its asymptotic expression 
for large values of the argument, and we obtain 


a i sin[(/-+3)6+27] 
Ney 4/8 
(we can neglect 4 in the coefficient compared with /). On comparison with 


(49.5), we find that A = 4/(2/al),« = r. Thus we obtain finally the 
following expression for P,(cos 6), applicable in the quasi-classical case :T 


PSG J= i eee) 


: (49.7) 
al +/sin 6 


The normalized spherical harmonic function Yj9 is obtained from this as 
(cf. (28.8)) 


_ i? sin{(+4)0-+47] 


TT a/sin 8 i 


Let us now turn to the radial part of the wave function. It has been 
shown in §32 that the function x(7) = rR(r) satisfies an equation identical 
with the one-dimensional Schrédinger’s equation, with the potential energy 


h2 Wl-+1 
U(r) = UG) +— Saal 


r? 





Hence we can apply the results obtained in the previous sections, if the 
potential energy is understood to be the function U (r). 

The case / = 0 is the simplest. The centrifugal energy vanishes and, if 
the field U(r) satisfies the necessary condition (46.6), the radial wave 
function will be.quasi-classical in all space. For r = 0 we must have y = 0, 
and hence the quasi-classical function y(r) is determined by formulae (47.6). 

If l Æ 0, the centrifugal energy also must satisfy the condition (46.6). In 
the region of small r, where the centrifugal energy is of the same order as 
the total energy, the wavelength A = 27h/p ~ r/l, and the condition (46.6) 
gives l> 1. Thus, if Zis small, the quasi-classical condition is violated by the 


t Note that, as a result of replacing /(/+1) by (1+ 4)%, we have obtained an expression 


pr ii oa by (—1)! when @ is replaced by r — @; this is as it should be for the function 
i{cos 
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centrifugal energy in the region of small r. It is easily seen that we obtain 
the correct value of the phase of the quasi-classical wave function y(r) by 
calculating it from the formulae for one-dimensional motion, replacing the 
coefficient /(/+1) in the potential energy U (r) by (1+ 4)2:T 


K2 (14-22 
U(r) = U(r)+ an +4) A (49.9) 
2m 7r? 





The question of the applicability of the quasi-classical approximation to a 
Coulomb field U = +2/r requires special consideration. The most import- 
ant part of the whole region of the motion is that corresponding to distances 
r for which |U] ~ JE], i.e. r ~ a/|E|. The condition for quasi-classical 
motion in this region amounts to the requirement that the wavelength 
A ~h//(2m|E|) is small compared with the dimensions af/|E| of the 
region; this gives 


JE] < mo? /h?, (49 10) 


i.e. the absolute value of the energy must be small compared with the energy 
of the particle in the first Bohr orbit. This condition can also be written in 
the form 


a/v > 1, (49.11) 


where v ~ 1/(|E|/m) is the velocity of the particle. It should be noticed 
that this condition is the opposite of the condition (45.7) for the applicability 
of perturbation theory to a Coulomb field. 

The region of small distances (|U/(r)| > £) is without interest in a repulsive 
Coulomb field, since for U > E the quasi-classical wave functions diminish 
exponentially. In an attractive field, however, when Z is small it is possible 
for the particle to penetrate into the region where |U| > E, so that we have 
to consider the limits of applicability of the quasi-classical approximation in 
this case. We use the general condition (46.7), putting there 


F = —dU/dr = -ajr?, p = V(2m|U|) ~ V(ma/r). 


As a result, we find that the region of applicability of the quasi-classical 
approximation is restricted to distances such that 
r > hime, (49.12) 


i.e. distances large in comparison with the “radius” of the first Bohr orbit. 


PROBLEM 


Determine the behaviour of the wave function near the origin, if the field becomes infinite 
as + a'r’, with s > 2, when r ~ 0. 





+ For example, ın the simple case of free motion (U = 0) the phase of the function calcula- 
ted from formula (48.1) with U; from (49.9) will be the same as the phase of (33.12) for 
large r, as it should be. 
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SOLUTION. For sufficiently small r, the wavelength A ~ È y (mj Uj) mw Fir 8/2: y (ma), 
so that dà'dy ~ Art!2-1/4/(maz) <1; thus the quasi-classical condition is satisfied. In an 
attractive field U, ~ — œ when r-— 0. The region near the origin is in this case classically 
accessible, and the radial wave function y ~ 1 4/p, whence 

yy m~ r? i=l. 


In a repulsive field, the region of small r is classically inaccessible. In this case the wave 
function tends exponentially to zero as r > 0. Omitting the coefficient of the exponential 


function, we have 
f 
l 24/(2m 
ys mw exp| -> fe dr j or yb ~ expl- res , 





§50. Penetration through a potential barrier 


Let us consider the motion of a particle in a field of the type shown in 
Fig. 13, characterized by the presence of a potential barrier, i.e. a region in 
which the potential energy U(x) exceeds the total energy E of the particle. 
In classical mechanics, a potential barrier is “impenetrable” to a particle; 
in quantum mechanics, however, a particle can pass ‘‘through the barrier”: 





Fic. 13 


the probability of this is not zero. The phenomenon is also called the tunnel 
effect.t If the field U(x) satisfies the quasi-classical conditions, the trans- 
mission Coefficient for the barrier can be calculated in a general form. We 
may remark that, in particular, these conditions give the result that the barrier 
must be “wide”, and hence the transmission Coefficient is small in the quasi- 
Classical case. 

In order not to interrupt the subsequent calculations, we shall first solve 
the following problem. Let the quasi-classical wave function in the region 


to the right of the turning point x = b (where U(x) < E) have the form of a 
travelling wave: 


p= — exp: f? dx+ bin |. (50.1) 


tExamples of this type have already occurred in §25, Problems 2 and 4. 


180 The Quasi-Classical Case §50 
We require to find the wave function of this state in the region x < b. This 
can be done by the same procedure as in §47, using the plane of the complex 
variable x. Putting 


E— U(x) x Fo{x—b), Fo > 0, 


we can write the function (50.1) as 


li Sapte asa | (mF) | V(x—b) dat fin! 
(2mF o (x —b)]}1/4 hi J js 


and pass from right to left along a semicircle in the upper half-plane: 


x—b = peit, i | /(x—b) dx = 3p9/%(—sin 36 +i cos 39), 
b 


the phase ¢ varying from 0 to m. The function ẹ(x) at first decreases and then 
increases in modulus, its value at the end of the semicircle being 


b 
C 


i > 1/2 | 
p(x) = [2mF o (b — xy etma exp fal V[(2mF) / (x —5)] dx + fim i" 


x 


Thus we obtain the correspondence rulet 


C if , ë Ler 
— exp < - p dx+ tinh > exp Fall dx 
VP ti J vipl Bi i 


forx >b forx < b 








l (50.2) 


It must be emphasized that this rule presupposes a particular form of the 
wave function (a wave travelling to the right) in the classically allowed region, 
and must be applied to go from the latter to the classically inaccessible region. 

Let us now go on to calculate the coefficient for the penetration of the 
potential barrier. Let the particle be incident on the barrier from left to 
right, coming from region I. Then, in region III beyond the barrier, there 





t In a passage from right to left through the lower half-plane, the function y(x) at first 
increases and then decreases in modulus, becoming an exponentially small quantity on the 
left-hand axis (¢ - —7), which it would not be legitimate to keep superimposed on the 
exponentially large function (50.2). In the region where (x) is exponentially large, the 
inexactness of the quasi-classical approximation loses the exponentially small correction 
which for ¢ > —2 could become an exponentially large term, and the latter is therefore 


lost also. 
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will be only the wave that has passed through the barrier and is propagated 
to the right; the wave function in this region may be written 


p= J: exp (|; |p dx + tin), (50.3) 
b 


where v = p/m is the particle velocity and D the current density in the wave. 
Using the rule (50.2), we can now find the wave function in region IT, within 


the barrier: 
P exp (jf vl 
= /— exp |- | pdx 
art) il, 


D ie ae ae ae 
= ee si ante dx, }. 50.4 
ao (5 | Pas AG =) (50.4) 


Finally, applying the rule (47.5), we have in region I in front of the barrier 


b a 
ff = 2 [= exp GI |p| dx) cos Gf dx= 4r). 
v 


T 











If we put here 


D = exp - Z [le dx), (50.5) 


this becomes 


b= ER Al p dx+ 4r) 
U h 
= = exp (; | detin) + exp G i | dx— yin 
This first term (which becomes a plane wave f = e#/APt as x > — œ) 


represents a wave incident on the barrier, and the second a reflected wave. 
The normalization chosen corresponds to a unit current density in the 
incident wave, and therefore D, the current density in the transmitted wave, 
is equal to the required transmission coefficient for the barrier. Note that 


this formula is applicable only if the exponent is large, so that D itself is 
small.t 


+ The exponential smallness of D is related to the fact that the amplitudes of the incident 
and reflected waves in region I are found to be the same; the exponentially small difference 
between them is lost in the quasi-classical approximation. 
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It has been assumed in the foregoing that the field U(x) satisfies the quasi- 
classical condition over the whole extent of the barrier (excluding only the 
immediate neighbourhood of the turning points). In practice, however, we 
often have to deal with barriers where the potential energy curve on one side 
drops so steeply that the quasi-classical approximation is inapplicable. The 
exponential factor in D remains the same in this case as in formula (50.5), 
but the coefficient of the exponential (equal to unity in (50.5)) is different. 
To calculate it we must, essentially, calculate the exact wave function in the 
non-quasi-classical region and determine the quasi-classical wave function 
inside the barrier in accordance with this. 


PROBLEMS 


PROBLEM 1. Determine the transmission coefficient for the potential barrier shown in 
Fig. 14 (p. 182): U(x) = Oforx < 0, U(x) = U,—Fx for x > 0; only the exponential factor 
need be calculated. 


uix) 
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SOLUTION. A simple calculation gives the result 
44/(2m) 
D~exp(— (U —Ey at 
es alle 
PROBLEM 2. Determine the probability that a particle (with zero angular momentum) will 
emerge from a centrally symmetric potential well with U(r) = —U, for r < rq, U(r) = ajr 
for r > ro (Fig. 15).t 
U(r) 
Fo z 
“Up 
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SOLUTION. The centrally symmetric problem reduces to a one-dimensional one, so that 
the formulae obtained above can be applied. We have 


+ This problem was first discussed by G. Gamow (1928) and by R. W. Gurney and 
E. U. Condon (1929) in connection with the theory of radioactive a-decay. 


§50 Penetration through a potential barrier 183 


s~est- T JEC) 


Evaluating the integral, we finally obtain 


se A a A 


In the limiting case rẹ —> 0, this formula becomes 


u mw gs IAV im 1E) = eT 2na {Av 


These formulae are applicable when the exponent is large, i.e. when a/v > 1. This condi- 


tion agrees, as it should, with the condition (49.11) for quasi-classical motion in a Coulomb 
field. 


ProsLeM 3. The field U(x) consists of two symmetrical potential wells (I and II in Fig. 
16), separated by a barrier. If the barrier were impenetrable to a particle, there would be energy 
levels corresponding to the motion of the particle in one or other well, the same for both 
wells. The fact that a passage through the barrier is possible results in a splitting of each of 
these levels into two neighbouring ones, corresponding to states in which the particle moves 
simultaneously in both wells. Determine the magnitude of the splitting (the field U(x) is 
supposed quasi-classica)). 
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SOLUTION. An approximate solution of Schrédinger’s equation in the field U(x), neglecting 
the probability of passage through the barrier, can be constructed with the quasi-classical 
wave function ¢o(x) which describes the motion with a certain energy E, in one well, say I, 
i.e. which is exponentially damped on both sides of this well; the function f(x) is assumed to 
be norm" ‘zed so that the integral of p? over well I is unity. When the small probability of 
tunnelling is taken into account, the level E, splits into levels E, and E,. The correct zero- 


approximation wave functions corresponding to these levels are the symmetric and anti- 
symmetric combinations of p(x) and ¥%,(—-x): 


y(x) = Ios) oun): 
š (1) 
Ja(x) = alol) -vol = =) 
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In well I, the function ¥o(—x) 1s vanishingly small in comparison with p(x); in well II the 
opposite is true. The product yYo(x)%o(—x) is therefore vanishingly small everywhere, and 
the functions (1) are normalized so that the integrals of their squares over wells I and IJ 
are unity. 

Schrédinger’s equations are 


Yo +(2mi/h®)(Ey—U)d, = 0, by + (2mih?)(E, — U = 0; 


we multiply the former by y, and the latter by Yo, subtract corresponding terms, and integrate 
over x from 0 to œ. Bearing in mind that, for x = 0, p, = V2. and ¢,’ = 0, and that 


(vb dx x 5 {we dx = I/,/2, 


we find 
E,—E, = —(F?/m)yo( Oyo (0). 


Similarly, we find for E,—E, the same expression with the sign changed. Thus 
E,—E, = (25° / m) (0a (0). 


By means of formula (47.1), with the coefficient C from (48.3), we find that 





Zrv 


y0) = / > a[i fi dx |, 4,'(0) = 0), 
G 


where vp = V/[2(U,—E,)/m]. Thus 


wh if 
E,—E, = “epf —; flo ax. 


where a is the turning point corresponding to the energy E,; see Fig. 16. 
PROBLEM 4. Determine the exact value of the transmission coefficient D for the passage 


of a particle through a parabolic potential barrier U(x) = —4kx® (supposing that D is not 
small) (E. C. Kemble 1935).+ 


SOLUTION. Whatever the values of k and E, the motion is quasi-classical at sufficiently 
large distances |x|, with 


p = VJ [2m(E4dhx*)] = x4/(mk)+ Ey/(m/k)/x, 
and the asymptotic form of the solutions of Schrédinger’s equation is 


z - 
yy = constant xetté /2gste—1/2, 


where we have introduced the notation 
E = x(mkjht ls, © = (Elh) (m]k) 


We are interested in the solution which, as x > +00, contains only a wave which has 
passed the barrier, 1.e. is propagated from left to right. We put 


as x> Œ, yy = Bere 2g 1/2 (1) 
asx -—-C&, wy = g-i" — £) feo 24 Aet 2 — Ejie-3 2 (2) 


+ The solution of this problem can also be applied to penetration sufficiently near the top 
of any barrier U(x) whose dependence on x near the maximum ts quadratic. 
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In the expression (2), the first term represents the incident wave, and the second the reflected 
wave (the direction of propagation of a wave is that in which its phase increases). The 
relation between A and B can be found by using the fact that in this case the asymptotic 
expression for y is valid in the whole of a sufficiently distant region of the plane of the complex 
variable ¢. Let us follow the variation of the function (1) as we go round a semicircle of large 
radius p in the upper half-plane! of £: 


E = peta? = p?( —sin 2¢ + i cos 2¢), 


with ¢ varying from 0 to r. As a result of traversing this semicircle, the function (1) becomes 
the second term in (2), with coefficient 


A= Bein jie- 2 $ —1B~-7«; (3) 


in the part of the path (47 < ¢ < r) where the modulus |e**"!?} is exponentially large, the 
exponentially small quantity which should give the first term in (2) is lost.t 


With the normalization of the incident wave chosen in (2), the condition of conservation of 
number of particles is 


|Al?+|B)? = 1. (4) 
From (3) and (4) we find the required transmission coefficient: 


D = |B|? = 1/(1 +e-2"9). 
This formula holds for any E. If the energy ıs large and negative, it gives D = e—?"!! 
in accordance with formula (50.5). For E > 0. the quantity 


R=1-D = 1/1 +e"$) 


is the coefficient of reflection above the barrier. 


§51. Calculation of the quasi-classical matrix elements 


A direct calculation of the matrix elements of any physical quantity f with 
respect to the quasi-classical wave functions presents great difficulty. We may 
suppose that the energies of the states between which the matrix element is 
calculated are not close to each other, so that the element does not reduce to 
the Fourier component of the quantity f (§48). The difficulties arise because, 
owing to the fact that the wave functions are exponential (with a large imagin- 
ary exponent), the integrand oscillates rapidly. 

We shall consider a one-dimensional case (motion in a field U(x)), and sup- 
pose for simplicity that the operator of the physical quantity is merely a func- 
tion f (x) of the coordinate. Let p and y be the wave functions correspond- 
ing to some values E, and ŒE, of the energy of the particle (with EF, > E, 


Fig. 17); we shall suppose that y, and ws, are taken real. We have to calculate 
the integral 


t The passage through the lower half-plane to determine A would be unsuitable, since 
on the part of the path ee < ¢ < —47) that adjoins its left-hand end (where ¢ is given 
by (2)), the term in etë? is exponentially small in comparison with e~-#?/2, 
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fia = | tafe dx. (51.1) 





ĉi i 
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According to (47.5), the wave function y1 in the regions on both sides of 


the turning-point x = a, but not in its immediate neighbourhood, is of the 
form 


for x <a); hee | -; [ee] 


(51.2) 


for x > a, h = cos ab dr), 
Pi 


and similarly for y, (replacing the suffix 1 by 2). 

However, the calculation of the integral (51.1) by substituting in it these 
asymptotic expressions for the wave functions would not give the correct 
result. The reason is, as we shall see below, that this integral is an exponen- 
tially small quantity, whereas the integrand is not itself small. Hence even a 
relatively small change in the integrand will in general change the order of 
magnitude of the integral. This difficulty can be circumvented as follows. 

We represent the function y, as a sum pe = Yat +437, expressing the cosine 


(in the region x > ag) as the sum of two exponentials. According to (50.2), we 
have 


for x < an pt = -e apf [ns all 


spools [mee] 


(51.3) 





for x > ap, Yt = 
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the function y7 is the complex conjugate of pat : p = (gt). 

The integral (51.1) is also divided into the sum of two complex conjugate 
integrals fig = fiet-+fie-, which we shall proceed to calculate. First of all, 
we note that the integral 


f -Í pf yt dx 


converges. For, although the function ¢e+ increases exponentially in 
the region x < aş the function 4}, in the region x < a,, tends exponentially 
to zero still more rapidly (since we have |p,| > |p,| everywhere in the region 
x < Gp). 

We shall regard the coordinate x as a complex variable, and displace the 
path of integration off the real axis into the upper half-plane. When x 
receives a positive imaginary increment, an increasing term appears in the 
function 4 (in the region x > a), but the function %,* decreases still more 
rapidly, since we have p, > p, everywhere in the region x > a. Hence the 
integrand decreases. 

The displaced path of integration does not pass through the points x = a,, 
a, on the real axis, near which the quasi-classical approximation is inapplic- 
able. Hence we can use for ¥, and ypt, over the whole path, the functions 
which are their asymptotic expressions in the upper half-plane. These are 


oF ir 
yy, = al § | vemu—z)) a | 
i (51.4) 
bate ne 


mae ae pe, 


where the roots are taken so as to be positive on the real axis for x < aj, ae. 
In the integral 


p —iC,C> 


a= 44/(2m) 





Í exp| = f /{2m(U—E,)} de f J/{2m(U—E,)} ax | x 


Aa) ae 
[(U—E\(U—E,)p 


we desire to displace the path of integration in such a way that the exponential 
factor is diminished as much as possible. The exponent has an extreme value 
only where U(x) = œ (for E, # E, its derivative with respect to x vanishes 
at no other point). Hence the displacement of the contour of integration into 
the upper half-plane is restricted only by the necessity of passing round the 
singular points of the function U(x); according to the general theory of linear 
differential equations, these coincide with the singular points of the wave 
function ¥(x). The actual choice of the contour depends on the actual form 


(51.5) 
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of the field U(x). Thus, if the function U(x) has only one singular point 
x = xo 1n the upper half-plane, the integration can be effected along the type 
of path shown in Fig. 18. The immediate neighbourhood of the singular 
point plays the important part in the integral, so that the matrix element 


fie = 2 re fig+ required is practically proportional to an exponentially small 
expression of the form 


fie ~ exp ~ simf f vene- U)] dx— f V [2m(E1— U)] ax || (51.6) 


(L. D. Landau 1932). 


Xo 
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The lower limits of the integrals may be any points in the classically 
accessible regions; their particular values evidently do not affect the imaginary 
parts of the integrals. If the function U(x) has several singular points in the 
upper half-plane, xo in (51.6) must be taken as that for which the exponent is 
smallest in absolute value.t 

Formula (51.6) becomes simpler when the energies E, and =Æ, are 
almost the same, so that the matrix element reduces, according to the 
results in §48, to the Fourier time component of the classical quantity 
J [x (¢)]. Putting E, = E $h, and expanding in powers of hœ, we 
find 


fia~ exp -omnim | |=" dx) = apon im) (51.6a) 


The quantity 





ft In deriving formulae (51.5) and (51.6), we have replaced the wave functions by their 
asymptotic expressions, since, in the integral taken along the contour shown in Fig. 18 (p. 188), 
the order of magnitude of the integral is determined by that of the integrand; hencea relatively 
small change in the latter does not have any great effect on the value of the integral. 

fî We assume that the quantity f(x) itself has no singular points. 

The estimate (51.6) for the matrix element presupposes a “normal” order of magnitude for the 
coefficient of the exponential. There can of course be cases where the nature of the problem makes 
this coefficient unusually small. The simplest example is when / (x) = constant. The matrix element 
is then zero because the wave functions are orthogonal; this is not shown by (51.6). 
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x 


T -| ge = {axe (x) 


can be regarded as the complex time at which the particle reaches the point 
x) in the complex x-plane. The quantity 


Ae fe — U &)) 


m 


is the corresponding complex velocity. It is easily seen that (51.6a) in fact 
gives the approximate expression for the Fourier component of f [x (¢)] 
if œa imt > l. 

The quasi-classical matrix elements for motion in a centrally symmetric 
field must be calculated by the same method. However, we must now replace 
U(r) by the effective potential energy (the sum of the potential energy and 
the centrifugal energy), which will be different for states with different Z. 
In view of further applications of the method in question, we shall write the 
effective potential energies in the two states in a general form, as Uj(r) and 
Ue(r). Then the exponent in the exponential factor in the integrand in (51.5) 
has an extreme value not only at the points where Uj(r) or U(r) becomes 
infinite, but also at those where 

U,(r)— U(r) = ELE). (51.7) 
Hence, in the formula 


fe~ exp| — - im | f EE U2)] dr— Í E U1)] ar |} (51.8) 


the possible values of rẹ include not only the singular points of U,(r) and 
U(r), but also the roots of equation (51.7). 

The centrally symmetric case differs also in that the integration over r in 
(51.1) is taken from 0 (and not from — œ) to œ: 


fie = f Xıf X2 dr. 


Here two cases must be distinguished. If the integrand is an even function 
of r, the integration can be formally extended to the whole range from — œ 
to co, so that there is no difference from the previous case. This may occur 
if U,(r) and U,(r) are even functions of r [U(—r) = U(r)]. Then the wave 
functions y,(r) and y(r) are either even or odd functionst (see §21), and, 
if the function f(r) is also even or odd, the product y, fy, may be even. 

_ If, on the other hand, the integrand is not even (as always happens if U(r) 
1s not even), the start of the path of integration cannot be moved away from 


the point r = 0, and this point must be included among the possible values of 
rg in (51.8). 


_ t For even U(r), the radial wave function R(r) is even (or odd) when / is even (or odd), as 
is seen from its behaviour for small r (where R ~ ri). 
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PROBLEMS 


PROBLEM 1. Calculate the quasi-classical matrix elements (exponential factor only) in a 
field U = U,e7°*. 


SoLuTION. U(x) becomes infinite only for x > —œ. Accordingly, we put x,» = — œ% 
in (51.6). We can extend the integration to + ©. 

Each of the integrals diverges at the lower limit. Hence we first calculate them from —x 
to ©, and then pass to the limit x ~ œ. We find 


ii a~ g trmiahivm t), 


where vi = y (2Eı/m), ve = y/(2E2/m) are the velocities of the particle at infinity (x > œ), 
where the motion is free. 


PROBLEM 2. The same as Problem 1, but in a Coulomb field U = a/r, for transitions be- 
tween states with / = 0. 


SOLUTION. The only singular point of the function U(r) is r = 0. The corresponding 
integral has been calculated in §50, Problem 2. As a result we have by formula (51.8) 


mala 


ProsLeĮm 3. The same as Problem 1, but for an anharmonic oscillator with potential energy 
U (x) =}mm?x? + Bx‘, under the condition 


hw <E,, E, < mo |p. (1) 


Sotution. The generalization of the analysis given in the text to the case of finite motion shows 
that (51.6) remains valid. As x) we must take the points x — + 0. both of which give contributions 
of the same order. Then 


= -E 
faxo =; | J [2m(U — E,)] dx — | V[2m(U — E,)] a |) 


With the condition (1), the main contribution comes from the range 
V(Elmo’), sf (Ea men”) < |x] < / (mo?i), (2) 
in which 
mw’? > Ep Ey, Bx’. 


Expanding the exponent in powers of £, ,/U (the zero-order terms cancel) and neglecting Bx’, 


we have 
f E, LA E, { atl 
¿~ exp —— | —+— i 
ý EV ho lx] hæ f| |x| 


The logarithmically divergent integrals are to be cut off at the ends of the range (2): x ~ / (men? /B) 
above and x ~ a, ~ y [E |m% j, x ~ a, ~ y [E mw’) below. The result is 











E. mot E m’wt 
f oe 2 i 
i2 > : 


E E E vee 
aho S BE: She PE; 


With the state numbers n, z (E,/h@), n, % (E,/hw}, we can write this as 


f ny :2 Bh na — Mi 2 
2 ; 
ne? \ mw? 
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Since large values of x are important in the solution, the result is valid if f(x) does not increase too 
rapidly at infinity. If f(x) is a polynomial, its degree must be much less than n, — n. 


§52. The transition probability in the quasi-classical case 


Penetration through a potential barrier is an example of a process which 
is entirely impossible in classical mechanics. In the quasi-classical case the 
probability of such processes is exponentially small. The relevant exponent 
can be determined as follows. 

Considering a transition of any system from one state to another, we 
solve the corresponding classical equations of motion and find the “‘path” of 
the transition; this, however, is complex, in accordance with the fact that the 
process cannot occur in classical mechanics. In particular, it is found that, 
in general, the ‘‘transition point” go at which the formal transition of the 
system from one state to the other occurs is complex; the position of this 
point is determined by the classical conservation laws. We next calculate 
the action Sj(q1, go)+S2(go, q2) for the motion of the system in the first 
state from some initial position qı to the “transition point” go, and then in the 
second state from go to the final position gz. The required probability of the 
process is then given by the formula 


2 
w ~ exp |- "i [S1 (91:90) + Se (go). (52.1) 


If the position of the “transition point” is not unique, it must be chosen 
so that the exponent in (52.1) has the smallest absolute value (which must yet, 
of course, be sufficiently large for formula (52.1) to be valid). 

Formula (52.1) is tn accordance with the rule derived in §51 for calculating 
the quasi-classical matrix elements. It should be emphasized, however, that 
it would not be correct to use the square of the matrix element in calculating 
the coefficient before the exponential in the probability of such transitions. 

The method of complex classical paths based on (52.1) is a general one, 
applicable to transitions in systems with any number of degrees of freedom 
(L. D. Landau 1932). If the transition point is real, but lies in the classically 
inaccessible region, then (in the simple case of one-dimensional motion) 
formula (52.1) is the same as (50.5) for the probability of penetration through 
the potential barrier. 


REFLECTION ABOVE THE BARRIER 


Let us apply (52.1) to the one-dimensional problem of reflection above the 
barrier, i.e. reflection of a particle whose energy exceeds the height of the 
barrier. In this case, go is to be taken as the complex coordinate xo of the 
“turning point” at which the particle reverses its direction of motion, i.e. 
the complex root of the equation U(x) = E. We shall show how the reflection 





t If the potential energy of the system has itself singular points, these also must be con- 
sidered as possible values of gp. 
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coefficient may then be calculated more precisely, including the coefficient 
of the exponential. 

We must again (as in §50) establish the relation between the wave functions 
far to the right of the barrier (the transmitted wave) and far to the left (the 
incident and reflected waves). This is easily done by a method similar to 
that used in §§47 and 50, regarding 4 as a function of the complex variable x. 

We write the transmitted wave in the form 


l : T 
TE —exp(= | pdx), 
VP h 
Tı 


where xı is any point on the real axis, and follow its variation on passing along 
a path C in the upper half-plane which encloses (at a sufficient distance) the 
turning point xo (Fig. 19); the whole of the latter part of this path must 
lie so far to the left that the error in the approximate (quasi-classical) wave 
function of the incident wave is less than the required small quantity _. 
Passage round the point xo causes a change in the sign of the root 4/[E — U(x)], 
and after the return to the real axis the function %, therefore becomes 
yy_, a wave propagated to the left (1.e. the reflected wave).t Since the ampli- 
tudes of the incident and transmitted waves may be regarded as equal, the 


c 


Fic. 19 


required reflection coefficient R is simply the ratio of the squared moduli 


of y- and p=: 


2 
R= — = exp(—=im | pax). (52.2) 
(i 


Having derived this formula, we can deform the path of integration in the 
exponent in any manner; if we convert it into the path C’ shown in Fig. 19, 
the integral reduces to twice the integral from x; to xo, giving 


R = exp ( —4o(x1, xo)/ħ), o(%1, x0) = im | p(x) dx; (52.3) 


+ A passage along a path below the point x, (simply going along the real axis, for example) 
converts the function ¢_ into the incident wave. 
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since pix) is real everywhere on the real axis, the choice of x; is 
immaterial.t Note that the coefficient of the exponential in (52.3) is unity 
(V. L. Pokrovskii, S. K. Savvinykh and F. R. Ulinich 1958). 

As already mentioned, among the possible values of x) we must select the 
one for which the exponent in (52.3) is smallest in absolute magnitude (and 
this value must be large compared with umity).|] It ts also implied that, 
if the potential energy U (x) itself has singularities in the upper half-plane, 
the integral o(x,,%») has larger values for such points§; otherwise the 
exponent would be determined by one of these points, but the coefficient 
of the exponenual would not be unity as in (52.3). This condition is 
certainly not satisfied with increasing energy E if U(x) becomes infinite 
anywhere in the upper half-plane: ultimately the point x, at which U = E 
becomes so close to the point x, where U = œ that the two points give 
comparable contributions to the reflection coefficient (the integral 
Oix, Xo) ~ 1), and formula (52.3) becomes invalid. In the limit where E 
is so large that this integral is small compared with unity, perturbation 
theory becomes applicable (see Problem 2). Tf 


PROBLEMS 

PROBLEM 1. Using the quasi-classical approximation, with exponential accuracy, determine 
the probability of disintegration of a deuteron in collision with a heavy nucleus regarded as 
the fixed centre of a Coulomb field (E. M. Lifshitz 1939). 

SoLuTion. The principal contribution to the reaction probability comes from collisions 
with zero orbital angular momentum. In the quasi-classical approximation these are the 
head-on collisions, 1n which the movement of the particles becomes one-dimensional. 

Let E be the deuteron energy ın units of <, the binding energy of the proton and the neutron 
in the deuteron; En and Ep the energies of the released neutron and proton in the same units. 
We shall also use the dimensionless coordinate q = er! Ze? (where Ze is the charge on the 
nucleus), and denote by go its value (which is in general complex) at the “transition point”, 
i.e. at the “moment of disintegration” of the deuteron. We can write 


En = bon, Ey = tea, E = pate (1) 
qo go 


here wn, tp and va are the velocities of the particles at the moment of disintegration, in units of 
\'(e/m), where m is the nucleon mass; t'n is real and is the same as the velocity of the released 
neutron, but vp and va are complex. The conditions for the conservation of energy and 
momentum at the transition point give 


t In some cases, not only the amplitude relations but also the phase relauons between the incident 
and reflected waves are of interest. These are described by the reflection amplitude, expressed in 
terms of the coefficients x and f §25,. l is easily shown by the above arguments that, in parucular, 
the reflecuon amplitude for a wave incident from the tefi is 


2 | 
BY in* = ~i ool 5 (f: dx son) y> —-XK. 


Phe factor — i is due to the change in the phase of the coefficient of the exponential in passing round 
the branch point (§47 >. 

$ lhe proof given here is due to L. D. Landau (1961). 

| Of course. oniy points x, are considered for which o > G, i.e. points lying in the upper half-plane. 

§ The contour © in Fig. 19 must pass below the singularities of L'/x). 

tł An intermediate case is discussed by V. L. Pokrovskii and I. M. Khaiamikov, Soviet Physics 
JETP 13, 1207, 1961. , 
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Ep+ En = E-1,  tytin = 2ra, (2) 
whence 
3 : H . 
Up = ittm ta = 14+%a, — = E41 ~¢,24 2iep. 
g0 


The action of the system before the transition corresponds to the motion of the deuteron 
in the field of the nucleus up to the ‘point of disintegration; its imaginary part is 


Ze? E mf ‘| | 4(E- 


ze | im 2a ~ = cosh *vigoe)} (3) 


im Śı 


tt 


After the transition, the action corresponds to the motion of the neutron and the proton 
away from the point of disintegration: 


fem [aa [Ae] 


2 
Ze? J im ‘a Tago — “pgo + J Tu VE») (4) 


tt 


im So 


According to (52.1), the probability of the process is 





9 t 
w ~ exp F £ [rie Re cosh -!4/(g9E;) -5 cosh ~1 v(goE) |} (5) 


In accordance with the fact that the two inverse hyperbolic cosines here come from (4) and 
(3), the signs of their imaginary parts must be the same as those of im vp and im va respectively, 
and the signs of the latter in the solution of equations (2) are chosen so as to make 
im(S;+.S8,) > 0. 

Because w depends exponentially on En, the total probability of disintegration (with any 
values of En and Ep = E—1 —En) is given by the minimum absolute value of the exponent 
as a function of En. Analysis shows that this occurs when En > 0. Then g, = 1/(E+1), and 
from (5) we find 


tae Pn); ee ee E 
wn exp “I le l = —— COS l — i 
; R-i Eel fF E+) 


The condition for this formula to be valid is that the exponent should be large compared 
with unity. 

Having calculated the imaginary part of the action S = S, + S, for non-zero values of 
En, we can find the energy distribution of the particles released. Near En = 0, we havet 





dim § 





im S(En) — im S(0) = E, 0 


A calculation of the derivative gives 


cts ~ ce P CHENE. SEFER cos ml) 
dEn exp f- sE [eren I(E + Ta V(2(£ — 17] E+1 


PROBLEM 2. Determine the coefficient of reflection above the barrier for particle energies 
such that perturbation theory is applicable. 





+ When Er = 0, the function ım S(En) has a cusp from which it increases for both positive 
and negative Ex (the negative values corresponding to the capture of the neutron by the 
nucieus). 
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SOLUTION, Formula (43.1) is used, the initial and final wave functions being plane waves 
propagated in opposite directions and normalized respectively by unit current density and the 
delta function of momentum divided by 27h, with dv = dp'/2rħ and p’ the momentum after 
reflection. Carrying out the integration with respect to p’ (taking account of the delta function), 
we obtain 


m2 


Ra kan { U(x)e?!Pz/A dxi? (1) 





A2p2 


This formula is valid if the conditions for perturbation theory to be applicable are satisfied: 
Uajhv <1, where a is the width of the barrier (see the third footnote to §45), and also 
pajh s 1. The latter condition ensures that the function R(p) is not exponential; otherwise 
the question of the validity of formula (1) would require further investigation. 


PROBLEM 3. Determine the coefficient of reflection above the barrier for a quasi-classical 
barrier when the function U(x) has a discontinuity of slope at x = Xp. 


SOLUTION. If the function U(x) has a singularity for real x, the reflection coefficient is 
determined mainly by the field near that point, and perturbation theory can be formally 
applied to calculate it, without having to be valid for allx; the fulfilment of the quast-classical 
condition is sufficient. We then have formula (1) of Problem 2, the only difference being that 
the momentum of the incident particle must be replaced by the value of p(x) at the singular 
point. 


In this case we take the point of discontinuous slope as x = 0, and thus have near this point 
U = —F,x for x> 0, U = —Fox for x <0, 


with different Fı and F2. The integration with respect to x is effected by including in the 
integrand a damping factor e+42 and then letting A > 0. The result ts 


m2 


he 
R= Fo — F,)*, 
Tepe 2— Fi) 


where po = p(0). 


§53. Transitions under the action of adiabatic perturbations 


It has already been mentioned in §41 that, in the limit of a perturbation 
which varies arbitrarily slowly with time, the probability of a transition of 
a system from one state to another tends to zero. Let us now consider this 
problem quantitatively, by calculating the transition probability under the 
action of a slowly varying (adiabatic) perturbation (L. D. Landau 1961). 

Let the Hamiltonian of the system be a slowly varying function of time, 
tending to definite limits as £ -> + œ, and let yn(qg, t) and E,,(t) be the eigen- 
functions and the eigenvalues of the energy (depending on time as a para- 
meter) obtained by solving Schrédinger’s equation A(t), = Enn; on 
account of the adiabatic variation of H with time, the time variation of En 
and %, with time will also be slow. The problem is to determine the proba- 
bility 21 of finding the system in a certain state #2 ast > + œ, if it was in 
the state W; as t > — o: 

The slow variation of the perturbation means that the duration of the 
“transition process” is very long, and therefore the change in the action during 
this time (given by the integral — f E(z) dt) is large. In this sense the problem 


is quasi-classical, and the required probability is mainly determined by the 
values zo of z for which 
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Ey(to) = Ez(to) (53.1) 


and which correspond, as it were, to the “instant of transition” in classical 
mechanics (cf. §52); in reality, of course, such a transition is classically 
impossible, as 1s shown by the fact that the roots of equation (53.1) are 
complex. It 1s therefore necessary to examine the properties of the xslutions 
of Schrédinger’s equation for complex values of the parameter f in the 
neighbourhood of the point t = to at which the two eigenvalues of the energy 
become equal. 

As we shall see, the eigenfunctions %1, %2 vary rapidly with ¢ near this 
point. To determine this dependence, we first define linear combinations 
di, $2 of Yi, Ye which satisfy the conditions 


| $2 dg = foe dg = 0, f dade dg = 1. (53.2) 


This can always be achieved by suitable choice of the complex coefficients 
(which are functions of t). The functions ¢), $2 have no singularity at t = fo. 
We now seek the eigenfunctions as linear combinations 


yb = adit argo. (53.3) 


Here ıt must be borne in mind that, when the “time”’ tis complex, the operator 
H(t) (of the form (17.4)) is still equal to its transpose (H = Ê), but is no 
longer Hermitian (H # H*), since the potential energy U(t) # U*(t). 

We substitute (53.3) in Schrédinger’s equation, multiply on the left by 
di Or do, and integrate with respect to q. With the notation 


F7;;(t) = Í difer da, (53.4) 


and using the fact that Hio = H2) owing to the above-mentioned property 
of the Hamiltonian, we obtain the equations 


Aya, + Higa = Eao, 
ae \ (53.5) 


H281 + Heea2 = Ea). 


The condition for these equations to have non-zero solutions is (Hi2- E} = 
H H?292, and the roots of this give the energy eigenvalues 


E = Hyt (Hu). (53.6) 
Then (53.5) gives 
ajay = + (Ai1/H22). (53.7) 


It is seen from (53.6) that, for a coincidence at the point £ = £o of the two 
eigenvalues, either H11 or Hee must vanish at that point; let Mia vanish there. 
At a regular point, a function in general vanishes as ¢ — fo, and therefore 
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E(t)— E(to) = + constant x +/(t—fo), (53.8) 


i.e. E(t) has a branch point at £ = to. We also have as ~ 4/(t— to), and so 
there is at the point ż = fo only one eigenfunction, ¢). 

We now see that the problem is formally completely analogous to the 
problem of reflection above the barrier discussed in §52. We have a wave 
function ¥(ż) which is ‘‘quasi-classical with respect to time’, instead of the 
function quasi-classical with respect to the coordinate in §52, and wish to 
find the term of the form cəpoe™tE.t A in the wave function for t + + œ, 
if the wave function Y(t) = fue t#,"’" as t > — œ. This is analogous to the 
problem of determining the reflected wave for x > — œ from the transmitted 
wave for x > + œ. The required transition probability wo, = |co|?. The 
action S = — f E(t) dt is given by the time integral of a function having 
complex branch points (just as the function p(x) in the integral f p dx had 
complex branch points). The problem under consideration is therefore 
dealt with by means of a contour in the plane of the complex variable z 
from large negative to large positive values, just as in §52 for the plane of the 
variable x, and we shall not repeat the derivation here. 

We shall suppose that Ey > E on the real axis. Then the contour must lie 
in the upper half-plane of the complex variable £ (where the ratio 
e iE ,t/hle-tE t/h increases). The resulting formula (analogous to (52.2)) is 


W2) = exp( simf E(t) ar), (53.9) 


Cc 


where the integration is along the contour shown in Fig. 19 (from left to 
right). 

On the left-hand branch of this contour E = E), and on the right-hand 
branch E = E. We can therefore write (53.9) in the form 


ty 


wo, = exp( —2im f wo(t) dr), (53.10) 


fi 


where we) = (E2— £)/h, and ty is any point on the real axis of t; to must be 
taken as that root of equation (53.1) lying in the upper half-plane for which 
the exponent in (53.10) is smallest in absolute value.t In addition, besides 
the direct transition from state 1 to state 2, there may be possible paths 
through various intermediate states; the probabilities of these are given by 
analogous formulae. For example, for a transition 1 > 3 -> 2 the integral 
in (53.10) is replaced by a sum of integrals: 


tC B 


f wa)(t) d? + | we3(t) dz, 


+ The possible values of to must include any points at which E(t) becomes infinite; for 
such points the coefficient of the exponential in (53.10) will not be unity. 
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where the upper limits are the “points of intersection” of the terms F(t), E(t) 
and E,(t), E(t) respectively. This result is obtained by means of a contour which 
encloses both these complex points. 


PROBLEM 


Determine the change in the adiabatic invariant for a classical oscillator which obeys the equation 
d?x /dt? + w?(é)x =0, (1) 


when the frequency w (t) varies slowly from its value œ, at £ > — 00 to œw, at £ > 90 (A. M. Dykhne 
1960). 


SoLuTion. Equation (1) is derived from Schrédinger’s equation by a change of notation: 


Wy x, xb, p(x)h =k(x) > w(t); 


the problem is then formally equivalent to that of reflection from a potential barrier (§29). This 
allows the calculation of the change in the adiabatic invariant to be reduced to that of the reflection 
amplitude. 

We write the solution of (1) for £ + +0 as 


x= A, + Abe OV) x -—00, 


x = Ave 4 Ake, x +00. 
From (25.6), 
A, = aA, + PAF. (2) 


The adiabatic invariant of the oscillator is E/@, so that 
1, = mw, x? =2mw,)A\|?, L = 2maw,|A,|?. 
or, substituting (2), 
l = 2mo; [(|a]? + |B IPL Ail? + 2 re(x8*47)]. 
With the relation (25.7), which in the present notation is |«|? = |$ |? + @,/w,, we have 
I, — 1, = 4mm,[|B |?| Ay]? + re(aB*A7)]. (3) 


The case where w(t) varies slowly corresponds, in the barrier reflection problem, to the 
quasi-classical case in §52. Then $ is exponentially small, and |a |? + w,/@,. (It is assumed that w? (£) 
has no singularities or zeros on the real f-axis.) The procedure described in §52 for calculating the 
reflection amplitude gives the estimate 


to 


Al=1,-1,~|B| ~op —2im fwar) 


fy 


where ¢, is the singular point in the upper half-plane of £ that gives the greatest contribution to AZ. 
This formula coincides with the results in Mechanics, §51, for the harmonic osciliator case considered. 
When w? (t) has a simple zero in the upper half-plane, the formulae in §52 allow the coefficient of 
the exponential to be found also; see the footnote on p. 193. 

The second (and principal) term in (3) depends on the initial phase of the oscillations. It becomes 
zero when averaged over that phase, so that 


Al = 2Ri,, 
where R > (w,/a,)|B|? is the “reflection coefficient”. 


+ The intermediate states of a continuous spectrum require a special discussion. 


CHAPTER VIII 


SPIN 


§54. Spin 

IN BOTH classical and quantum mechanics, the law of conservation of angular 
momentum 1s a consequence of the isotropy of space with respect to a closed 
system. ‘his already demonstrates the relation between the angular momen- 
tum and the symmetry properties under rotation. In quantum mechanics, 
however, the relation in question 1s a particularly far-reaching one, and 
essentially constitutes the basic content of the concept of angular momentum, 
especially as the classical definition of the angular momentum of a particle 
as the productr x p has no direct significance in quantum mechanics, owing 
to the fact that position and momentum cannot be simultaneously measured. 

We have seen in §28 that, if the values of / and m are specified, the angular 
dependence of the wave function of the particle is determined, and therefore 
so are all its symmetry properties under rotation. The most general formula- 
tion of these properties involves specifying the transformation of the wave 
functions when the coordinate system is rotated. 

The wave function #z37 of a system of particles (with specified values of 
the angular momentum L and its component M) remains unchanged} only 
in a rotation of the coordinate system about the z-axis. Any rotation that 
alters the direction of this axis has the result that the z-component of the 
angular momentum does not have a definite value. This means that, in the 
new coordinates, the wave function in general becomes a superposition (a 
linear combination) of 2L+1 functions corresponding to the different 
possible values of M for the given L. We can say that the 2L + 1 functions 
Yrm are transformed into linear combinations of one another when the 
coordinate system is rotated.[ The law governing this transformation (i.e. the 
coefficients in the superposition as functions of the angles of rotation of the 
coordinate axes) is entirely determined by specifying the value of L. Thus 
the angular momentum acquires the significance of a quantum number 
which classifies the states of the system according to their transformation 
properties under rotation of the coordinate system. This aspect of the 
concept of angular momentum in quantum mechanics is particularly 
important because it is not directly related to the explicit angular dependence 
of the wave functions; the law of mutual transformation of these functions 
can be stated without reference to that dependence. 





t Apart from an unimportant phase factor. 

t In mathematical terms, these functions are the irreducible representations of the rotation 
group. The number of functions which are transformed into linear combinations of one 
another is called the dimension of the representation; it is assumed that this number cannot 
be made smaller by taking any other linear combinations of these functions. 
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Let us consider a composite particle, such as an atomic nucleus, which is at 
rest as a whole and is in a definite internal state. In addition to an internal 
energy, it has also an angular momentum of definite magnitude L, due to the 
motion of the particles within the nucleus. This angular momentum can 
have 2L + 1 different orientations in space. Thus, in considering the move- 
ment of a complex particle as a whole, we must assign to it, as well as its 
coordinates, another discrete variable: the projection of its internal angular 
momentum on some chosen direction in space. 

However, with the preceding understanding of the concept of angular 
momentum, the origin of it becomes unimportant, and we naturally arrive at 
the concept of an “intrinsic” angular momentum which must be ascribed 
to the particle regardless of whether it is “composite” or ‘‘elementary”’ 

Thus, in quantum mechanics an elementary particle must be assigned a 
certain “‘intrinsic’’ angular momentum unconnected with its motion in space. 
This property of elementary particles 1s peculiar to quantum theory (it 
disappears ın the limit A —> 0), and therefore has in principle no classical 
interpretation.t 

The intrinsic angular momentum of a particle is called its spzn, as distinct 
from the angular momentum due to the motion of the particle in space, 
called the orbital angular momentum.] The particle concerned may be either 
elementary, or composite but behaving in some respect as an elementary 
particle (e.g. an atomic nucleus). The spin of a particle (measured, like the 
orbital angular momentum, in units of A) will be denoted by s. 

For particles having spin, the description of the state bv means of the wave 
function must determine the probability not only of its different positions in 
space but also of the possible orientations of the spin. Thus the wave function 
must depend not only on three continuous variables, the coordinates of the 
particle, but also on a discrete spin variable, which gives the value of the 
projection of the spin on a selected direction in space (the s-axis) and takes a 
limited number of discrete values, which we shall denote by o. 

Let ¥(x, y, z; o) be such a wave function. It is essentially a set of several 
different functions of the coordinates, corresponding to different values of o; 
these functions will be called the spin components of the wave function. The 
integral 


filan y, z; 0)? AV 


determines the probability that the particle has a certain value of o. The 
tqo E . J . » R 7 a » 
probability that the particle is in the volume element dJ’ with any value of o is 


dV È id(x, y, z; 0)/*. 





¢ In particular, it would be wholly meaningless to imagine the “‘intrinsic’’ angular 
momentum of an elementary particle as being the result of its rotation ‘‘about its own axis”. 

t The physical idea that an electron has an intrinsic angular momentum was put forward 
by G. Uhlenbeck and 5. Goudsmit in 1925. Spin was introduced into quantum mechanics 
in 1927 by W. Paull. 
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The quantum-mechanical spin operator, on being applied to the wave 
function, acts on the spin variable o. In other words, it in some way linearly 
transforms the components of the wave function into one another. The form 
of this operator will be established later. However, it is easy to see from very 
general considerations that the operators $+, Sy, Sz satisfy the same com- 
mutation conditions as the operators of the orbital angular momentum. 

The angular momentum operator is essentially the same as that of an 
infinitely small rotation. In deriving, in §26, the expression for the orbital 
angular momentum operator, we considered the result of applying the rotation 
operator to a function of the coordinates. In the case of the spin, this 
derivation becomes invalid, since the spin operator acts on the spin variable, 
and not on the coordinates. Hence, to obtain the required commutation 
relations, we must consider the operation of an infinitely small rotation in a 
general form, as a rotation of the system of coordinates. If we successively 
perform infinitely small rotations about the x-axis and the y-axis, and then 
about the same axes in the reverse order, it is easy to see by direct calculation 
that the difference between the results of these two operations is equivalent 
to an infinitely small rotation about the z-axis (through an angle equal to the 
product of the angles of rotation about the x and y-axes). We shall not pause 
here to carry out these simple calculations, as a result of which we again 
obtain the usual commutation relations between the operators of the com- 
ponents of angular momentum; these must therefore hold for the spin oper- 
ators also: 


5} =, CS = 8, eS) =i, (54.1) 


together with all the physical consequences resulting from them. 

The commutation relations (54.1) enable us to determine the possible 
values of the absolute magnitude and components of the spin. All the results 
derived in §27 (formulae (27.7)-(27.9)) were based only on the commutation 
relations, and hence are fully applicable here also; we need only replace L 
in these formulae by s. It follows from formula (27.7) that the eigen- 
values of the component of the spin form a sequence of numbers differing 
by unity. However, we cannot now assert that these values must be integral, 
as we could for the component L, of the orbital angular momentum (the 
derivation given at the beginning of §27 is invalid here, since it was based 
on the expression (26.14) for the operator /,, which holds only for the orbital 
angular momentum). 

Moreover, we find that the sequence of eigenvalues s, is limited above and 
below by values equal in absolute magnitude and opposite in sign, which we 
denote by +s. The difference 2s between the greatest and least values of s, 
must be an integer or zero. Consequently s can take the values 0, 4, 1, 3, .... 

Thus the eigenvalues of the square of the spin are 


s? = s(s+ 1), (54.2) 


where s can be either an integer (including zero) or half an integer. For given 
s, the component s, of the spin can take the values S, Se~il, es, =s, i.e. 2s+ 1 
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values in all. Accordingly, the wave function of a particle with spin s has 2s + 1 
components.T 

Experiment shows that the majority of the elementary particles (electrons, 
positrons, protons, neutrons, -mesons and all hyperons (A, =, =)) have a spin 
of 3. There are also elementary particles, the z-mesons and the K-mesons, 
whose spin is zero. 

The total angular momentum of a particle is composed of its orbital angular 
momentum l and its spin s. Their operators act on functions of different 
variables, and therefore, of course, commute. The eigenvalues of the total 
angular momentum 


j=l+s (54.3) 


are determined by the same “‘vector model” rule as the sum of the orbital 
angular momenta of two different particles (§31). That is, for given 
values of Z and s, the total angular momentum can take the values /+s, 
l+s—1,..., |/—s|. Thus, for an electron (spin $) with non-zero orbital angular 
momentum /, the total angular momentum can be f = /+4; for l= 0 the 
angular momentum 7 has, of course, only the one value? = 4. 

The operator of the total angular momentum J of a system of particles is 
equal to the sum of the operators of the angular momentum j of each particle, 
so that its values are again determined by the vector model rules. The angular 
momentum J can be put in the form 


J=L+S8, L=Z1, S=2s,, (54.4) 
a a 


where S may be called the total spin and L the total orbital angular momentum 
of the system. We notice that, if the total spin of the system is half-integral 
(or integral), the same is true of the total angular momentum, since the orbital 
angular momentum is always integral. In particular, if the system consists 
of an even number of similar particles, its total spin is always integral, and 
therefore so 1s the total angular momentum. 

The operators of the total angular momentum j of a particle (or J, of a 
system of particles) satisfy the same commutation rules as the operators of 
the orbital angular momentum or the spin, since these rules are general com- 
mutation rules holding for any angular momentum. The formulae (27.13) 
for the matrix elements of angular momentum, which follow from the com- 
mutation rules, are also valid for any angular momentum, provided that the 
matrix elements are defined with respect to the eigenstates of this angular 
momentum. Formulae (29.7)—(29.10) for the matrix elements of arbitrary 
vector quantities also remain valid (with appropriate change of notation). 


+ Since s is fixed for each kind of particle, the spin angular momentum ñs becomes zero 
in the limit of classical mechanics (A > 0). This consideration does not apply to the orbital 
angular momentum, since / can take any value. The transition to classical mechanics is 
represented by A tending to zero and / simultaneously tending to infinity, in such a way that 
the product Ål remains finite. 
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PROBLEM 


A particle with spin $ is in a state with a definite value sz = 4. Determine the probabilities 
of the possible values of the component of the spin along an axis 2’ at an angle @ to the z-axis. 

SOLUTION. ‘The mean spin vector § is evidently along the z-axis and has magnitude 4. 
Taking the component along the 2’-axis, we find that the mean value of the spin in that 
direction is sz% = $cos@. We also have sz = 4(ws—w-_), where ws are the probabilities 
of the values sz = +4. Since w++w- = 1, we find w, = cos*46, w- = sin?}8. 


§55. The spin operator 


In the rest of this chapter we shall not be interested in the dependence of 
the wave functions on the coordinates. For example, in speaking of the 
behaviour of the functions (x, y, z; o) when the system of coordinates is 
rotated, we can suppose that the particle is at the origin, so that its coordinates 
remain unchanged by such a rotation, and the results obtained will charac- 
terize the behaviour of the function w with regard to the spin variable o. 

The variable o differs frorn the ordinary variables (the coordinates) by 
being discrete. The most general form of a linear operator acting on functions 
of a discrete variable o is 


(fF Ho) = E fopa’), (55.1) 


where the fea» are constants. We put fẹ in parentheses in order to emphasize 
that the spin argument following it is not that of the original function y4 but 
that of the function resulting from it under the action of the operator f. It is 
easy to see that the quantities fse. are the same as the matrix elements of the 
operator, defined by the usual rule (11.5).+ 

The integration over the coordinates in (11.5) is here replaced by summa- 
tion over the discrete variable, so that the definition of-the matrix element is 


foo, = E pe (OLS pe (c)]. (55.2) 


Here pa (c) and Ya (c) are the eigenfunctions of the operator fz corresponding 
to the eigenvalues sz = o1 and a; each such function corresponds to a state 
in which the particle has a definite value of sz, i.e. in which only one com- 
ponent of the wave function is non-zero:f 


ac) = See, Yala) = Sea, (55.3) 





fT Note that the suffixes in the matrix elements on the right of (55.1) are written in an 
order which is, in a sense, the reverse of the usual order in (11.11). 
| More precisely, we should write 


Po, (a) E y(x, Y: xis a 


in (55.3) the coordinate factors are omitted, being unimportant in this connection. 


We must once again emphasize the distinction between the specified eigenvalue o, or a, of 
sz and the independent variable a. 
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According to (55.1), 


Nn 


(f fa (0) = © foopo (0) 
= © foa Osa, 
= lees 


and on substitution of this and Ye (0) the equation (55.2) is satished iden- 
tically; this completes the proof. 

Thus the operators acting on functions of o can be represented in the form 
of (2s + 1)-rowed matrices. In particular, we have for the operator of the spin 
itself, acting on the wave function, by (55.1), 


(8Y)(c) = FE Sco-f(o’). (55.4) 


According to what has been said at the end of §54, the matrices Sz, Sy, $z are 
identical with the matrices Lz, be Ê obtained in §27, where the letters L and 
M need only be replaced by s and o: 


(Sz)o.on1 = CA = $4/[(sto)(s—ct+])], 
(Sy)oo-1 =—-(Syoig = —H1y/[(s+0)(s—o+])}, (53:5) 
(Sz)oo = © 

This determines the spin operator. 


In the important case of a spin of $ (s = 4, o = +4), these matrices have 
two rows, and are of the form 


ê = }ô, (55.6) 


á 0 1 à 0 —: p l 0 z 
Or = (; a) Oy = 6 ) Cz = ’ 4) (55.7) 


These are called Pauli matrices. The matrix $; = }6,1s diagonal, as it should 
be, since it is defined in terms of the eigenfunctions of the quantity sz itself.] 

The following are some specific properties of the Pauli matrices. Direct 
multiplication of the matrices (55.7) gives the equations 


whereT 


Il 
q 
N 

ro 

l! 
— 


PR a 2 
BT l (55.8) 


yz = Tia O26; = 1y, xy — iz. 


+ In the tabular matrices (55.7) the rows and columns are numbered by the values of o, 
the row number corresponding to the first and the column number to the second suffix of 
the matrix element. In the present case, these numbers are +} and —}. The action of the 
operator shown by (55.4) multiplies row o of the matrix by a column matrix containing the 
components of the wave function: 

E ). 
v( ~ 4) 


t There should be ro misunderstanding because of the use of the same letter to denote the 
spin component and the Pauli matrices, since the latrer always have the circumflex. 
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Combining these with the general commutation rules (54.1), we find that 


ik + ki = 2ôik, (55.9) 


i.e. the Pauli matrices anticommute wıth one another. By means of these 
equations, we can easily verify the following useful formulae: 


&? = 3, (6.a)(6@.b) = a.b+:ĉ.axb, (55.10) 


where a and b are any vectors.t According to these relations, any scalar 
polynomial formed from the matrices 6; can be reduced to terms independent 
of ô and terms linear in 6; hence it follows that any scalar function of the 
operator ĝ reduces to a linear function (see Problem 1). Lastly, the values of 
the traces (sums of diagonal elements) of the Pauli matrices and their products 
are 

tro; = Ô, tr ojox, = 28;x. (55.11) 


Subsequent sections of this chapter give a more detailed account of the 
spin properties of wave functions, including their behaviour under any 
rotation of the coordinate system, but we may note immediately an important 
property of these functions, namely their behaviour in respect of rotations 
about the z-axis. 

Let there be an infinitesimal rotation through an angle 6¢ about the 
z-axis. The operator of such a rotation 1s expressed in terms of the angular 
momentum operator (in this case, the spin operator) as 1+70¢.8,. As a 
result of the rotation, the functions (co) therefore become ¢(c) + dy¥(c), 
where 


Wo) = 28¢ . Szo) = tox(o)d¢. 


Writing this relation in the form dy/dọ = zoy(c) and integrating, we find 
that a rotation through a finite angle ¢ changes the functions p(o) into 


Wa) = pojete. (55.12) 


In particular, a rotation through 27 multiplies them by a factor e?*te, which 
is the same for all o and is equal to ( — 1)®5 (since 2c always has the same 
parity as 2s). Thus, in a complete rotation of the coordinate system about the 
z-axis, the wave functions of particles with integral spin return to their 
original values, and those of particles with half-integral spin change sign. 


PROBLEMS 
PROBLEM 1. Reduce an arbitrary function of the scalar a+b.@ linear in the Pauli matrices 
to another linear function. 
SOLUTION. To determine the coefficients in the required formula f(a+b.2) = A+B.é6, 
we note that, when the z-axis is taken in the direction of b, the eigenvalues of the operator 


+ The terms on the right of (55.8)-(55.10) which are independent of @ must, of course, 
be understood as constants multiplying the unit two-by-two matrix. 
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a+b.é are a+b, and the corresponding eigenvalues of the operator f(a+b.@) are f(a+b). 
Hence we find 4 = 3[f(a+b)+ f(a—b)], B = (b/2b) [f(a+b)—f(a—d)]. 


PROBLEM 2. Determine the values of the scalar product sı . s2 of spins (4) of two particles 
in states in which the total spin of the system, S = sı +s2, has definite values (0 or 1). 


SOLUTION. From the general formula (31.3), which is valid for the addition of any two 
angular momenta, we find sı . s2 = 3 for S = 1, sı . S2 = —# for S = 0. 


PROBLEM 3. Which powers of the operator § of an arbitrary spin s are independent? 
SOLUTION. ‘The operator 


(§2—s)(§&—s+1)... (f2+5), 


formed from the differences between fz and all possible eigenvalues sz, gives zero when itis 
applied to any wave function, and is therefore itself zero. Hence it follows that (£2)2#t! is 
expressed in terms of lower powers of the operator z, so that only iis powers from 1 to 2s are 
independent. 


§56. Spinors 


When the spin is zero, the wave function has only one component, (0). 
The effect of the spin operator is to reduce it to zero: § = 0. The relation 
between § and the operator of an infinitesimal rotation implies that the wave 
function of a particle with zero spin is invariant under rotation of the co- 
ordinate system, i.e. it is a scalar. 

The wave function of a particle with spin 4 has two components, ¥(4) and 
¥( —4). For convenience in later generalizations, we shall distinguish these 
components by the superscripts 1 and 2 respectively. The two-component 


quantity 
ee 
is called a spinor. 


In any rotation of the coordinate system, the components of the spinor 
undergo a linear transformation: 


pl’ = apl +b, p = epl 4 dy. (56.2) 
This may be written 
w= (Oy, O=(% a) (56.3) 


where U is the transformation matrix.t Its elements are in general complex 
functions of the angles of rotation of the coordinate axes. They are con- 
nected by relations which follow directly from the physical conditions 
imposed on the spinor as the wave function of a particle. 

Let us consider the bilinear form 


pg- y2g1, (56.4) 


t The notation Uys implies that the rows of the matrix Ü are multiplied by the column yw. 
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where 4 and ¢ are two spinors. A simple calculation gives 
pr ge -yrl = (ad—be)(y1g2— Y2p1), 


1.€. (56.4) is transformed into itself when the coordinate system is rotated. 
If, however, there is only one function which is transformed into itself, it can 
be regarded as corresponding to zero spin, and therefore must be a scalar, 
1.€. must remain unchanged when the coordinate system is rotated in any 
manner. Hence we have 


ad—be = 1: (56.5) 


the determinant of the transformation matrix is unity.t 
Further relations follow from the requirement that the expression 


bh yl¥ 4 242%, (56.6) 


which determines the probability of finding the particle at a given point in 
space, should bea scalar. A transformation which leaves unchanged the sum 
of the squared moduli of the quantities is a unitary transformation, i.e. we 
must have Ü+ = U-1 (see §12). With the condition (56.5) the inverse 


matrix is 
oa 2 Pi 
-c ajf 


Equating this to the Hermitian conjugate matrix 


we find 


a=d*, b= —c*, (56.7) 


By virtue of the relations (56.5) and (56.7), the four complex quantities 
a, 6, c, d actually contain only three independent real parameters, corres- 
ponding to the three angles which define a rotation of a three-dimensional 
system of coordinates. 

Comparison of the expressions for the scalars (56.4) and (56.6) shows that 
¥'* and ¥2* must be transformed as Ww? and — yw respectively. It is easy to 
verify that this is in fact so, using (56.5) and (56.7). 

It is possible to put the algebra of spinors in a form analogous to that of 
tensor algebra. This is done by introducing, in addition to contravariant 





$ Such a transformation of two quantities is called a binary transformation. 

t This property is closely associated with symmetry under time reversal. The latter 
(see §18) involves the replacement of the wave function by its complex conjugate. Under 
tume reversal, the angular momentum components also change sign. Hence the functions that 


are the complex conjugates of the components ¢* = (4) and y? = y( ~— 4) must have proper- 


ties equivalent to those of the components corresponding to spin projections —4 and } 
respectively. 
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spinor components ¢1, p? (with superscripts), the covariant components (with 
subscripts), defined by 


di = P2, be = -yl (56.8) 


The invariant combination (56.4) of the two spinors may also be written as a 
scalar product 


pigi = Pldit p?p = pth? — pet; (56.9) 


here and below, summation over repeated (dummy) indices is implied, as in 
tensor algebra. We may note the following rule which has to be borne in 
mind in spinor algebra. We have 


Woa = Wldit+W?de = — ypg? — yig. 
phi = -pg (56.10) 


Thus 


Hence it is evident that the scalar product of any spinor with itself is zero: 
Wp, = 0. (56.11) 


According to the foregoing discussion, the quantities ¥, and qe are 
transformed as 1* and *, 1.€. 


Wa = (O*p);. (56.12) 


The product U*y; may also hz written as ¥U*, with the transposed matrix U*. 
Since U is unitary, we have U* = U-1, so that fi’, = (W0-1), ort 


p, = (p'0),. (56.13) 


Analogously to the transition from vectors to tensors in ordinary tensor 
algebra, we can introduce the idea of spinors of higher rank. Thus, a quantity 
Wà, having four components which are transformed as the products A¢# 
of the components of two spinors of rank one, is called a spinor of rank two. 
Besides the contravariant components pôr we can consider the covariant 
components p, and the mixed components y,” which are transformed as 
the products pġ, and pag” respectively. Spinors of any rank are similarly 
defined. 

The transition from contravariant to covariant spinor components and 
vice versa may be written 


pa = 21,40", P = grt da, (56.14) 


e--(°, 3) 56.15) 


+ The notation ¢U (with ¢ to the left of Ô) denotes that the components (4, Y2) as a row 
are multiplied by the columns of the matrix U. 


where 
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is the metric spinor in a vector space of two dimensions. Thus we have, for 
example, 


ye = Bap, day = Be ae 


so that pig = — 41! = — 2 day = Y1? = 22, and so on. 

The quantities g}, themselves form an antisymmetric unit spinor of rank 
two. It is easy to see that the values of its components remain unchanged 
under transformations of the coordinates, and that 


Zag" = AF, (56.16) 


where ôl = ô? = 1, ôl = 6,2 = 0. 

As in ordinary tensor algebra, there are two fundamental operations in 
spinor algebra: multiplication, and contraction with respect to a pair of in- 
dices. The multiplication of two spinors gives a Spinor of higher rank; thus, 
from two spinors of ranks two and three, Y), and ¥”"*, we can form a spinor 
of rank five, y,y"?*. Contraction with respect to a pair of indices (i.e. sum- 
mation of the components over corresponding values of one covariant and 
one contravariant index) decreases the rank of a spinor by two. Thus, a 
contraction of the spinor y,,”°? with respect to the indices u and v gives the 
spinor y),,?° of rank three; the contraction of the spinor yz) gives the scalar 
p>. Here there is a rule similar to that expressed by formula (56.10) :\if we 
interchange the upper and lower indices with respect to which the contraction 
is effected, the sign is changed (i.e. pà = —y4)). Hence, in particular, it 
follows that, if a spinor is symmetrical with respect to any two of its indices, 
the result of a contraction with respect to these indices is zero. Thus, for a 
symmetrical spinor pa, of rank two, we have 4 = 

A spinor of rank n symmetrical with respect to all its indices is called a 
symmetrical spinor of rank n. From an asymmetrical spinor we can construct a 
symmetrical one by the process of symmetrization, i.e. summation of the com- 
ponents obtained by all possible interchanges of the indices. From what has 
been said above, it is impossible to construct (by contraction) a spinor of lower 
rank from the components of a symmetrical spinor. 

Only a spinor of rank two can be antisymmetrical with respect to all its 
indices. For, since each index can take only two values, at least two out of 
three or more indices must have the same value, and therefore the compo- 
nents of the spinor are zero identically. Any antisyrmetrical spinor of rank 


two is a scalar multiple of the unit spinor g ru: We may notice here the fol- 
lowing relation: 


Bapto HEA tEn = 0 (56.17) 


(where ys) is any spinor), which follows from the above; this rule is simply a 
consequence of the fact that the expression on the left is (as we may easily 
verify) an antisymmetrical spinor of rank three. 

The spinor which is the product of a spinor y àp With itself, on contraction 
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with respect to one pair of indices, becomes antisymmetrical with respect to 
the other pair: 


On a a ey 


Hence, from what was said above, this spinor must be a scalar multiple of 
the spinor g),. Defining the scalar factor so that contraction with respect to 
the second pair of indices gives the correct result, we find 


Yr = Ep By" (56. 1 8) 


The components of the spinor ys), * which is the complex conjugate of 
Wau... are transformed as the components of a contravariant spinor $4#---, 
and conversely. The sum of the squared moduli of the components of any 
spinor is consequently invariant. 


§57. The wave functions of particles with arbitrary spin 


Having developed a formal algebra for spinors of any rank, we can now 
turn to our immediate problem, to study the properties of wave functions of 
particles with arbitrary spin. 

This subject 1s conveniently approached by considering an assembly of 
n particles with spin 3. ‘The greatest possible value of the z-component of the 
total spin is 47, which is obtained when s, = 4 for every particle (i.e. all the 
spins are directed the same way, along the z-axis). In this case we can 
evidently say that the total spin S of the system is also 4n. 

All the components of the wave function (c1, o2, ..., on) of the system 
of particies are then zero, except for (4, },..., 4). If we write the wave 
function as a product of n spinors d+... , each of which refers to one of the 
particles, only the component with A, y, ... = 1 in each spinor is not zero. 
` Thus only the product 441... is not zero. The set of all these products, 
however, is a spinor of rank n which is symmetrical with respect to all its 
indices. If we transform the coordinate system (so that the spins are not 
directed along the z-axis), we obtain a spinor of rank n, general in form except 
that it is symmetrical as before. 

The spin properties of wave functions, being essentially their properties 
with respect to rotations of the coordinate system, are identical for a particle 
with spin s and for a system of n = 2s particles each with spin 4 directed so 
that the total spin of the system iss. Hence we conclude that the wave function 
of a particle with spin s is a symmetrical spinor of rank n = 2s. 

It is easy to see that the number of independent components of a sym- 
metrical spinor of rank 2s is equal to 2s+1, as it should be. For all those 
components are the same whose indices include 2s ones and 0 twos; so are 
all those with 2s—1 ones and 1 two, and so on up to 0 ones and 2s twos. 

Mathematically, the symmetrical spinors provide a classification of the 
possible types of transformation of quantities when the coordinate system 


is rotated. If there are 2s+1 different quantities which are transformed 
linearly into one another (and which cannot be reduced in number by any 
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choice of linear combinations of them), then we can assert that their law of 
transformation is equivalent to that of the components of a symmetrical spinor 
of rank 2s. Any set of any number of functions which are transformed linearly 
into one another when the coordinate system is rotated can be reduced (by 
an appropriate linear transformation) to one or more symmetrical spinors.T 

Thus an arbitrary spinor apy... of rank n can be reduced to symmetrical 
spinors of ranks n, n—2,n—4, .... In practice, such a reduction can be made 
as follows. By symmetrizing the spinor zay... with respect to all its indices, 
we form a symmetrical spinor of the same rank n. Next, by contracting the 
original spinor y,,,.. with respect to various pairs of indices, we obtain 
spinors of rank n— 2, of the form #4,,..., which, in turn, we Symmeéetrize, so 
that symmetrical spinors of rank »—2 are obtained. By symmetrizing the 
spinors obtained by contracting with respect to two pairs of indices, 
we obtain symmetrical spinors of rank n —4, and so on. 

We have still to establish the relation between the components of a sym- 
metrical spinor of rank 2s and the 2s +1 functions (0), where o = s,s—1,..., 
—s. ‘The component 


11.-.] 22...2 








$+0 $-o s 


in whose indices 1 occurs s +o times and 2 s—o times, corresponds to a value 
o of the projection of the spin on the z-axis. For, if we again consider a system 
of n = 2s particles with spin $, instead of one particle with spin s, the product 
Wgl... y2p?... corresponds to the above component; this product belongs to a 


sta s—a 


state in which s+o particles have a projection of the spin equal to 4, and 
s—o a projection of — 4, so that the total projection is $(s-+o)—4(s—oc) = o. 
Finally, the proportionality coefficient between the above component of 
the spinor and (c) is chosen so that the equation 








2 [Y(o)|? = weal (57.1) 


holds; this sum is a scalar, as it should be, since it determines the probability 
of finding the particle at a given point in space. Inthe sum on the right-hand 
side, the components with (s+) indices 1 occur 


(2s)! 
(s+ c)!(s—a)! 


times. Hence it is clear that the relation between the functions ¥(o) and the 
components of the spinor is given i the formula 


t In other words, the symmetrical spinors form what are called irreducible representations 
of the rotation group (see §98). 


212 Spin §57 


The relation (57.2) ensures the fulfilment not only of the condition (57.1), 
but also, as we easily see, of the more general condition 


pehy, = L(—1)Y(a)$(—0), (57.3) 


where pòr. and ¢ àu.. are two different spinors of the same rank, while 
¥(c),¢(o) are functions derived from these spinors by formula (57.2); the 
factor (—1)*~° is due to the fact that, when all the indices of the spinor com- 
ponents are raised, the sign changes as many times as there are twos among 
the indices. 

Formulae (55.5) determine the result of the action of the spin operator 
on the wave functions ẹ(o). It is not difficult to find how these operators act 
on a wave function written in the form of a spinor of rank 2s. For a spin 4, 
the functions ($), Y(— $) are the same as the components y1, y? of the spinor. 
According to (55.6) and (55.7), the result of the spin operators’ acting on them 
will be 


(E) = dy, (Sy)! 
(Sau) 2 = dy" (Sy)? 


— ty’, (Sy = a, 


57.4 
it, (Gy)? = By ems) 


To pass to the general case of arbitrary spin, we again consider a system 
of 2s particles with spin $, and write its wave function as a product of 2s 
spinors. The spin operator of the system is the sum of the spin operators 
of each particle, acting only on the corresponding spinor, the result of this 
action being given by formulae (57.4). Next, returning to arbitrary symmetri- 
cal spinors, i.e. to the wave functions of a particle with spin s, we obtain 


ll pee 22 one 11 een 22 one 
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(Ei 8—0 $1(s-+ of Se 2i(s oS &—o—1’ (57.5) 
E ene eae Ilon ee 
Cl eas = opas =a 


Hitherto we have spoken of spinors as wave functions of the intrinsic angular 
momentum of elementary particles. Formally, however, there is no difference 
between the spin of a single particle and the total angular momentum of any 
system regarded as a whole, neglecting its internal structure. It is therefore 
evident that the transformation properties of spinors apply equally to the 
behaviour, with respect to rotations in space, of the wave functions jm of any 
particle or system of particles with total angular momentum j, independent of 
whether orbital or spin angular momentum is concerned. ‘There must therefore 
be some definite relation between the laws of transformation for the eigen- 
functions yj, under rotations of the coordinate system and those for the 
components of a symmetrical spinor of rank 2). 
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In establishing this relation we must, however, make a clear distinction 
between two aspects of the dependence of the wave functions on the component 
m (for a given value ofj). The wave function may be regarded as the probability 
amplitude for various values of m, or may be considered for a given value of m. 

These two aspects have already been discussed at the beginning of §55, 
where we dealt with the eigenfunction 6,,, of the operator §, which corres - 
ponds to sz = oo. The mathematical difference between them is especially 
clear for a particle of spin s = 4. In this case the spin function is, with respect 
to the variable c, a contravariant spinor of rank 1, 1.e. must be written in spinor 
notation as 6”, . With respect to oo it is therefore a covariant spinor. 

This is evidently a general result: the eigenfunctions ym can be put in 
correspondence with the components of a covariant symmetrical spinor of 
rank 27 by means of formulae analogous to (57.2) :T 


pim = pe O 22 (57.6) 
MO N GE- E 


The eigenfunctions of integral angular momentum j are spherical har- 
monics Yj. The casey = lis of particular importance. The three spherical 


harmonics Yim are 
. | . [3 
Yin =i |= 0080 =i [=m 
47 4or 
noel Sie : soaks 
Yia = F: lz sinô et? = F i | reskin), 
f Sar Sar 


where n is a unit vector along the radius vector. It is seen that these three 
functions are equivalent, as regards their transformation properties, to the 
components of a vector a, with the relations 


Jio = taan u=- et iay) yı -1 = geia) (57.7) 


Comparing with (57.6), we see that the components of a symmetrical spinor 
of rank two can be brought into correspondence with the components of the 


at This result can also be regarded somewhat differently. If the wave function ¢ of a particle 
in a state with angular momentum j is expanded in terms of the eignfunctions pjm: 


y% = Lam wim, 


then the coefficients am are the probability amplitudes for various values of m. In this sense 
they correspond to the “components” (mm) of a spin wave function, and this gives their law 
of transformation. On the other hand, the value of y at a given point in space cannot depend 
on the choice of the coordinate system, i.e. the sum È amyjm must be a scalar. Comparing 
with the scalar (57.3), we see that am must transform as (—1)/-" pj. m. 
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vector by the formulae 


i 7 l T , 
pi = P Yu = — te t tau) Yoo = ye (57.8) 


yl2— — Se pll = n, p22 = — etia) (57.9) 


2 
Conversely 
i l 
az = 14/212, Az = ——(r22-— yf), = ——(yll4 22), (57.10 
Vey WA Yh ay 7” g2). ( ) 
It is easily verified that with these definitions we have 
br uh? s a.b, (57.1 1) 


where a and b are vectors corresponding to the symmetrical spinors ô+ and 
$n. It is also not difficult to see that there is a correspondence between the 
spinor and the vectort 


pip” + ypt and 4/2a xb. (57.12) 
Formulae (57.10) may be compactly written by means of the Pauli matrices: 


a = ott, y= a. ohh; (57.13) 


V2 v2 


the matrix indices of ĝ are written as superscript and subscript in corres- 
pondence with the position of the spinor indices in %#. The origin of this 
formula is easily understood by considering the particular case where the 
spinor of rank two #4 reduces to a product of a spinor of rank one y+ and its 
complex conjugate %4*, Then the quantity 444*64,.# is the mean value of the 
spin (for a particle with wave function +) and it is therefore evidently a 
vector. 

The relations (57.8) or (57.9) are a particular case of a general rule: any 
symmetrical spinor of even rank 2j, where j is integral, can be correlated witha 
symmetrical tensor of half the rank (j) which gives zero on contraction with 
respect to any pair of indices; we call this an zrreducible tensor. 

This follows from the fact that the numbers of independent components 
of the spinor and of the tensor are the same (2j + 1), as may easily be seen.] 
The relation between the components of the spinor and of the tensor can be 
found by means of formulae (57.8)-(57.10), if we consider a spinor of the rank 
concerned as the product of several spinors of rank two, and the tensor as a 
product of vectors. 


x - Fi =. A À = 
t The mixed components of a symmetrical spinor may be written in the form woe without 
distinction between y^, and y^. 
fe 254. 1 j ible t f rank j integer) 
t We can say that the 27+1 components of an irreducible tensor of ra 7 (an integer), 
the 2}+1 spherical harmonics Yjm, and the 27+1 components of a symmetrical spinor of 
rank 2j give the same irreducible representation of the rotation group. 
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PROBLEMS 
PROBLEM 1. Rewrite the definition (57.4) of the operator of spin $ in terms of the spinor 
components of the vector 5. 


SoLuTion. By means of formulae (57.9), which give the relation between the vector S 
and the spinor $44, the definition (57.4) can be written as 


Say = Orp) 


PROBLEM 2. Derive formulae which determine the effect of the spin operator on a vector 
wave function of a particle with spin 1. 

SOLUTION. The relation between the components of the vector function Ww and the com- 
ponents of the spinor ~*” is given by formulae (57.9), and from (57.5) we have 


Fes = yh, Sap- = ý, Sapz =0 
(where p+ = Prttyy) or 
fyz = — ipy, Say = nbz, Saf, = Q. 


The remaining formulae are derived from these by cyclic permutation of the suffixes x, y, z. 
They can be written together as 


Sipe = — teint 


The complex vector p can be put in the form W = e™(u+iv), where u and v are real 
vectors, which can be taken to be mutually perpendicular if the common phase © is suitably 
chosen. The two vectors u and v determine a plane which has the property that the spin 
component perpendicular to it can take only the values +1. 


§58. The operator of finite rotations 


Let us now return to the transformation of spinors, and show how the 
coefficients of this transformation can in fact be expressed in terms of the 
angles of rotation of the coordinate axes. 

By the definition of the angular momentum operator (in this case, the spin 
operator), 1+78¢.n.8 is the operator of a rotation through an angle 6¢ 
about a direction specified by the unit vector n; for application to the wave 
function of a particle with spin 4, i.e. a spinor of rank one, we must take 
§ = 4¢ in this operator. The operator of a rotation through a finite angle ¢ 
about the same direction will be correspondingly given by 


U,, = exp(didn . ô); (58.1) 


cf. (15.13). Like any function of the Pauli matrices (see §55, Problem 1), this 
expression reduces to one that is linear in these matrices: 


U,, = cos łġ+in. ê sin $ġ. (58.2) 
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For example, with a rotation about the z-axis, 


Ü) 


cos tọ +iôz sin ? 
_ fer 0 $ 


This means that the components of the spinor are transformed in such a 
rotation according to 


tole 
S 


fl? = let $/2, p = pret p2, 


In particular, in a rotation through-an angle 2r the spinor components change 
sign; spinors of any odd rank must therefore have the same property (cf. the 
end of §55). i 

Similarly, we can find the matrices of transformations consisting of a 
rotation through an angle ¢ about the x-axis or the y-axis: 


044) ye 4@ isin a 


i sin 4 cos łġ 
(58.4) 
Gay (0 +¢ sin a 


—sin łġ cos ł¢ġ 


We may note the particular case of a rotation through an angle z about the 
y-axis, for which 


Yl =y, y? = h, 
fl = dy, Y” = po: (58.5) 


l.e. 


It is now easy to write down the transformation matrix for any rotation of 
the coordinate axes, as a function of the Eulerian angles which specify the 
rotation. 

A rotation of the axes, defined by the Eulerian angles a, £, y, is carried out 
in three stages: (1) a rotation through the angle « (0 < « < 27) about the 
z-axis, (2) a rotation through the angle £ (0 < B < 7) about the new position 
of the y-axis (ON in Fig. 20, called the line of nodes), (3) a rotation through 
the angle y (0 < y < 27) about the resulting final position 2’ of the z-axis. 

It is evident that the angles a and £ are the spherical polar angles ¢ and @ 
of the new 2’-axis with respect to the xyz axes: a = ¢, 8 = 0. 

In accordance with this manner of rotating the axes, the matrix of the 


+ The systems xyz and x’y’’2’ are, as always, right-handed, and a positive angle corresponds 
to the movement of a corkscrew advanced in the positive direction of the axis of rotation. 

The definition of the Eulerian angles given here (and usual in quantum mechanics) differs 
from that in Mechanics, §35, in that the second rotation 1s about the y-axis and not the X-axis. 
The angles a, $, y are related to the angles ¢, 8, # used in Mechanics (net the spherical polar 
angles ¢ and 8) by ¢ = a+r. G= 8, d= y— tn. 
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Fic. 20 


complete transformation is equal to t!.e product of three matrices (58.3) and 
(58.4): g ; À i 
U(a, P, y) = Udy)UAB)U2(2). 


By direct multiplication of the matrices we finally obtain 


Ua, P, y) = ( (58.6) 


cos 48 . et +y)/2 sin ip. e —tlz—y)/2 
— sin 46 ; et (z —y)/2 Cos 48 : eis +y)/2 È 


Spinors of higher ranks are, by definition, transformed as products of 
components of a spinor of rank one. In physical applications, however, we 
are interested in the wave functions ej; rather than the transformation laws 
of the spinors themselves. 

Let the functions pjm (m = j,j—1, -.., —j) describe, in a coordinate system 
xyz, a state having a definite value of the angular momentum j, and jm: the 
same state for the axes xyz’; in the first case m is the value of jz, and in the 
second case m’ is the value of jz. The two sets of functions are connected by 
linear relations, which we write in the form 


Wim = 2, DO arma, B, y) Pim’. (58.7) 


The coefficients DA) m:m form a matrix of order 2j; + 1 with respect to m’ and 
m, called the finite-rotation matrix DO); its elements are functions of the 
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angles «, $, y of rotation of the system x’y'z’ relative to xys. 

The finite-rotation matrix can be built up by means of the spinor represen- 
tation of the functions jm. Forj = 4, the two functions gi,n(m = + 3) form 
a covariant spinor of rank 1. According to (56.13), its transformation from 
x'y'z’ to xyz is effected by the matrix U (58.6), so that DQ/2) = Ut Its 
elements may be written 


where 
m = me” 3 -4 
da8) = 3 cos 48 sin 4 (58.8) 
—} —sin $8 cos 38 


For any value of j, the functions pjm are related to the components of a 
symmetrical covariant spinor of rank 27 by (57.6). The transformation 
matrix for the components of a spinor of rank 2j is the product of 2) matrices 
D2), each acting on one of the spinor indices. Carrying out the multipli- 
cation and returning to the functions Ọm, we find their transformation 
matrix: 


De: B, y) = emy di? (Beis, (58.9) 


the functions d}? (8) being given byf 


m 


e a a (008 BEI tm x 


x (sin 8ye =m pn mm +m cos p), (58.10) 


where 


P,» (cos B) = ka — cos B)-%(1+cos B)-8 x 


x ( i PI -cos pjeta(1+cos B)o+™] (58.11) 


d cos £ 


+ Note that the matrix indices in (58.7) are placed in the order that corresponds to multi- 
plying the columns of the matrix D\/) by the functions yym' arranged in a row. In the symbolic 
notation, (58.7) would have to be written Yym = (y; D”)m in accordance with (56.13). 

t The calculations are described by A. R. Edmonds, Angular Momentum in Quantum 
Mechanics, Princeton, 1957. The definition of the functions Dm'm”) by (58.9) differs from 
that used in Edmonds’s book by the interchange of « and y, this being the more natural 
treatment in the approach given here. 
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are called Jacobi polynomials.t We may note that 
Pa ( — cos £) = (— 1)” Pa‘®: (cos £). (58.12) 


The functions dẸ?„, possess a number of symmetry properties which 
might be derived from the expressions (58.11) and (58.12), but it is simpler 
to obtain them directly from the definition as coefficients in the rotational 
transformation. 

The matrix D is unitary, being the matrix of a rotational transformation. 
Since the transformation inverse to the rotation («, $, y) is the rotation 
(—y, —B, —«), we have for the real matrix d the relations 


a? (-B) = a _(B)- (58.13) 
The following equations are also valid: 
DD mP) = oO AP): (58.14) 
a2), (2) = (—1)9*™3 pm, 
dD (—7) = (-1)3- "8m, m, (58.15) 


2 (0) = A 


When j = } these are evident from (58.8); the generalization to arbitrary j is 
evident from the manner of construction of the transformation matrix, 
described above. 

A rotation through an angle 7—f can be carried out as two successive 
rotations through m and — £: 


BD (7 B) = Ed E nl — B) 
= (e e 2), 
or, using (58.13), 
D (a—f) = (1) aD m (B). (58.16) 


The result of two rotations about the same axis is independent of the 
sequence in which they occur. We must therefore arrive at the same result 
by carrying out the rotations through —f and ~ in the opposite order. 
Comparison of the result with (58.16) gives the relation 


O = (= Vy ea, al Den) 
From (58.17), (58.14) and (58.13), it follows that 
d AB) = (ym md, (B) = (—1)m'-ma,(—B). (58.18) 


t See Se of the Mathematical Appendices, formula (e.11), for the relation between these 
polynomials and the hypergeometric series. 


220 Spin §58 
Using (58.13)-(58.18), we can deduce various symmetry properties of the 


complete matrix elements DY. In particular, the complex conjugate 
function is given by 


Dine (as B,y) = sD a, P, =) 
6 dg 1)m SEROR ~ mn (&% B, y) (58.19) 


Mathematically, the matrices ÂO give the unitary irreducible represen- 
tations of the rotation group having dimension 2; + 1 (see §98 below). Hence 
we have immediately the orthonormality relation 


1 


| DR (e Bs 9) DA (BNE = Bis Smm b's (58.20 


where dw = sin £ da dB dy. 

The orthogonality of the functions with respect to the suffixes m and m’ is 
ensured by the factor ef™+m»; that with respect to the index j arises from 
the functions a? l» for Which we have 


n 


| ad (B)d,72,(B) - 4 sin £ dB = ES 


: (58.21) 


0 


Lastly, we shall give for reference the expressions for the functions d), 
for various particular values of the parameters. Forj = 1, we have 


m= 1 0 =l 
m = 
1 (1+ cos £) = sin B 4(1—cos £) 
dinm(P) = 0 — 5 sin £ cos f = sin £ (58.22) 
—1 4(1 — cos £) T sin A 4(1+cos £) 


For integral j = land m = 0, formulae (58.10) and (58.11) give 


d® (p) = (—1)"4,(B) = (- 1)” aon — va Pym(cos 8). (58.23) 


mo (l+m 


The derivation of this formula is easily seen from the original definition (58.7). 
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We shall assign the values of the functions jm: on the right of (58.7) to the 
z-axis, on which (forj = l) 


1 
Yin(ne) = è |E bn (58.24) 


The function jm, on the left is then the spherical harmonic function Yin(, «) 
of the spherical polar angles 6 = a, 6 = £ giving the direction of the 2’-axis. 
Substitution of (58.24) in (58.7) leads to 


Vina) 2a reine By), (58.25) 


which is equivalent to (58.23). 
Lastly, there is the following expression for the function with the maximum 
possible value of m or m’: 


i eagle 
JOB) = (—1)-"d 0 


7 bonm cosi+™ 48 sinf-m 3B. (58.26) 


§59. Partial polarization of particles 


By a suitable choice of the direction of the z-axis, we can always cause one 
component (e.g. 2) of a given spinor 4^, the wave function of a particle with 
spin 4, to vanish. This is evident from the fact that a direction in space 1s 
determined by two quantities (angles), i.e. the number of disposable parameters 
is just equal to the number of quantities (the real and imaginary parts of the 
complex 242) which it is desired to make zero. 

Physically this means that, if a particle with spin 4 (for definiteness, we shall 
speak of an electron) is in a state described by a spin wave function, then there 
is a direction in space in which the component of the particle spin has the 
definite value o = }. We can say that in such a state the electron is completely 
polarized. 

There are also, however, states of an electron which may be said to be 
partially polarized. Such states are not described by wave functions but only 
by density matrices, i.e. they are mixed states (with respect to spin) (see §14). 

The spin or polarization density matrix of an electron is a spinor pèr of rank 
two normalized by the condition 

P*a = phi + p%2= 1, (5981) 
and satisfying the “Hermitian” condition 
(p*,.)* = phi (59.2) 
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For a pure (i.e. completely polarized) spin state of the electron the spinor 
p*, reduces to a product of components of the wave function y+: 


pen = p y. (59.3) 


The diagonal components of the density matrix determine the probabilities 
of the values + 4 and — 4 of the z-component of the electron spin. The mean 
value of this component is therefore 


Sz = 2(p'1~ p*2); 
or, using (59.1), 
pr = itin p= hsa (59.4) 


In a pure state the mean value of the quantities s} = sz+isy is calculated as 
= Ang pa 
$4 = yp *S uh ’ 
i = yey 


Since, according to (55.6) and (55.7), the operators f4 are given by the 
matrices 


we find that 
= lye, = fp. 
Accordingly we have in a mixed state 


pa = Ss, pt = Ss: (59.5) 
Using the Pauli matrices, formulae (59.4) and (59.5) can be combined as 
p*, = 4(8*,4 26", . 5). (59.6) 


Thus all the components of the polarization density matrix of the electron 
are expressed in terms of the mean values of components of its spin vector. In 
other words, the real vector § entirely determines the polarization properties of 
a particle with spin 4. In the limit of complete polarization one of the com- 
ponents of this vector (with an appropriate choice of the directions of the 
axes) is 4 and the other two are zero. In the opposite case of an unpolarized 
state all three components are zero. In the general case of an arbitrary partial 
polarization and any choice of the coordinate system we have 0 < p < 1, 
where 
p = (z tat 57)? 


is a quantity which may be called the degree of polarization of the electron. 
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For a particle of arbitrary spin $, the density matrix is a spinor pê” 
of rank 4s, symmetrical in the first 2s and the last 2s indices and i e the 
conditions 


prey. = 1, (59.7) 
Pa E ® (59.8) 


Il 
mo) 

a=) 

a 
= 
E 


To calculate the number of independent components of the density matrix, 
we note that, among the possible sets of values of the indices À, p, ... (Or p, a, ...) 
there are only 2s+1 which are essentially different. Using also the fact that 
the components of the spinor p4#---,,.., are related by (59.7), we find that the 
number of different components is (2s+1)?—1 = 4s(s4+1). Although these 
components are complex, the relation (59.8) shows that this does not increase 
the total number of independent quantities describing the state of partial 
polarization of the particle, which is therefore 4s(s+ 1).f For comparison, it 
may be remarked that the state of complete polarization of the particle is de- 
scribed by only 4s quantities (the 2s+ 1 complex components of the wave function 
àn: related by one normalization condition and containing one common 
phase which is unimportant in the description of the state). 

Like any spinor of rank 4s, the spinor p2#--p,,.. is equivalent to a set of 
irreducible tensors of ranks 4s, 4s—2, ..., 0. In the present case there is only 
one tensor of each rank, since, on account of the symmetry properties of the 
spinor p4#---,,..., each contraction of it can be carried out in only one way: 
with respect to any one of the indices A, p, ... , and ene of p, ©, .... In addition, 
the scalar (tensor of rank 0) does not appear, reducing to unity by virtue of the 
condition (59.7). 


$60. Time reversal and Kramers’ theorem 


The symmetry of motion with respect to a change in the sign of the time is 
expressed in quantum mechanics by the fact that, if ẹ is the wave function of 
a stationary state of the system, the “time-reversed’’ wave function (which we 
denote by tev) describes a possible state with the same energy. At the end of 
§18 it has been pointed out that TeV is the same as the complex conjugate 
function %*. In this simple form the statement applies to wave functions where 
the spin of particles is neglected. When spin is present, a refinement is necessary. 

Let us take the wave function of a particle of spin s in the form of the contra- 
variant spinor ¥“--- (of rank 2s). On taking the complex conjugate function 
y.-.* we obtain a set of quantities which are transformed as components of a 
covariant spinor. Hence the operation of time reversal corresponds to a change 
from the wave function %’--- to a new wave function whose covariant com- 
ponents are given by 


rev 


Papa = JAk, (60.1) 


t When these quantities are given, so are the mean values of the components of the vector s 


and all their powers and products 2, 3, .., 2s at a time, which do not reduce to lower powers 
(see $55, Problem 3). 
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For a given set of values of the indices A, p, ... , the components of covariant 
and contravariant spinors correspond to values of the angular-momentum 
component which differ in sign. In terms of the functions pse, therefore, time 
reversal corresponds to a change from ss, to ws,_,, as it should, since a change 
in the sign of the time changes the direction of the angular momentum. The 
exact relation is given by (60.1): 


bso = ps*(— 1}. (60.2) 


Thus the change se > Ysa” required by the operation of time reversal 
signifies the changet 


rev 


ev = Ys*(— 1) -2. (60.3) 


When this operation is repeated, we have 


bee > Ys,-a PR > Pool — 1}8-9( — A pso( — 1). 


Thus a twofold time reversal restores the wave function to its original value 
only if the spin is integral; if the spin is half-integral, the sign of the wave 
function is changed. 

Let us consider an arbitrary system of interacting particles. The orbital and 
spin angular momenta of such a system are not in general separately conserved 
when relativistic interactions are taken into account. Only the total angular 
momentum J is conserved. If there is no external field, each energy level of the 
system has (2/+1)-fold degeneracy. When an external field is applied, the 
degeneracy is removed. The question arises whether the degeneracy can be 
removed completely, i.e. so that the system has only simple levels. This 1s 
closely related to the symmetry with respect to time reversal. 

In classical electrodynamics the equations are invariant with respect to a 
change in the sign of the time, if the electric field is left unchanged and the sign 
of the magnetic field is reversed.{ This fundamental property of motion must 
be preserved in quantum mechanics. Hence, not only in a closed system but in 
any external electric field (there being no magnetic field), there is symmetry 
with respect to time reversal. 

The wave functions of the svstem are spinors yAr--, whose rank 7 is twice 
the sum of the spins sq of all the particles (n = 2 È sq); this sum may not be 
equal to the total spin S of the system. 

According to what was said above, we can assert that, in any electric field, 
the wave function and its time reversal must correspond to states with the same 
energy. If a level is non-degenerate, it is necessary that these states should be 
identical, i.e. the corresponding wave functions must be the same apart from a 


+ Note that the rule for the complex conjugate of a spherical harmonic function, according 
to (28.9), coincides with the general rule (60.3). 
t See, for example, Fields, §17, and the end of §111 below. 
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constant factor (both, of course, being expressed as similar (covariant or contra- 
variant) spinors). 


We write Pie. = E Or, by (60.1), 


pre F Ciia (60.4) ' 


where C is a constant. 
Taking the complex conjugate of both sides of this equation, we obtain 


p = CMa *. 


We lower the indices on the left-hand side of the equation and correspond- 
ingly raise them on the right. This means that we multiply both sides of the 
equation by gq, --- and sum over the indices À, x, ... ; on the right-hand side 
we must use the fact that 


pozei = (1gp. 


As a result we have 
bu = Cp, 


Substituting y4#---* from (60.4), we find 
Pru... =< (DCC Yig.. 


This equation must be satisfied identically, i.e. we must have (—1)"CC* = 1. 
Since, however, |C|? is always positive, it is clear that this is possible only 
for even 7 (i.e. for integral values of the sum È Sq). For odd n (half-integral 
values of È sa) the condition (60.4) cannot be fulfilled. 

Thus we reach the result that an electric field can completely remove the 
degeneracy only for a system with an integral value of the sum of the spins of 
the particles. For a system with a half-integral value of this sum, in an 
arbitrary electric field, all the levels must be doubly degenerate, and complex 
conjugate spinors correspond to two different states with the same energy] 
(H. A. Kramers 1930). 

One further, mathematical, comment may be made. A relation of the form 
(60.4) with a real constant C 1s mathematically the condition that the com- 
ponents of the spinor may be put in correspondence with a set of real 
quantities, and may be called the condition for the spinor to be “real’’.|| The 
impossibility of fulfilling the condition (60.4) for odd z signifies that no real 
quantity can correspond to a spinor of odd rank. For even n, on the other 
hand, the condition (60.4) can be satisfied, and C can be real. In particular, a 


t When the sum È sq 1s integral (or half-integral), all possible values of the total spin S of 
the system are also integral (or half-integral). 

t If the electric field possesses a high (cubic) symmetry, fourfold degeneracy may occur 
(see §99, including the Problem). 


It is meaningless to call the spinor real in the literal sense, since complex conjugate spinors 
have different laws of transformation. 
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real vector can correspond to asymmetrical spinor of rank two if the condition 
(60.4) is satisfied with C = 1: 


pave = Pry 


(as is easily seen by means of (57.8) and (57.9)). The condition (60.4) with 
C = 1 is in fact the condition for a symmetrical spinor of any even rank to be 
“real”. 


CHAPTER IX 


IDENTITY OF PARTICUES 


§61. The principle of indistinguishability of similar particles 


In classical mechanics, identical particles (electrons, say) do not lose their 
‘“andividuality’, despite the identity of their physical properties. For we 
can imagine the particles at some instant to be “numbered”, and follow the 
subsequent motion of each of these in its path; then at any instant the particles 
can be identified. 

In quantum mechanics the situation is entirely different. We have 
already mentioned several times that, by virtue of the uncertainty principle, 
the concept of the path of an electron ceases to have any meaning. If the 
position of an electron is exactly known at a given instant, its coordinates have 
no definite values even at the next instant. Hence, by localizing and num- 
bering the electrons at some instant, we make no progress towards identifying 
them at subsequent instants; if we localize one of the electrons, at some other 
instant, at Some point in space, we cannot say which of the electrons has 
arrived at this point. 

Thus, in quantum mechanics, there is in principle no possibility of separ- 
ately following each of a number of similar particles and thereby distinguish- 
ing them. We may say that, in quantum mechanics, identical particles 
entirely lose their “individuality”. The identity of the particles with respect 
to their physical properties is here very far-reaching: it results in the complete 
indistinguishability of the particles. 

This principle of the zndistinguishability of similar particles, as it is called, 
plays a fundamental part in the quantum theory of systems composed of 
identical particles. Let us start by considering a system of only two particles. 
Because of the identity of the particles, the states of the system obtained 
from each other by merely interchanging the two particles must be com- 
pletely equivalent physically. This means that, as a result of this inter- 
change, the wave function of the system can change only by an unimportant 
phase factor. Let (£, €) be the wave function of the system, & and £é con- 
ventionally denoting the three coordinates and the spin projection for each 
particle. Then we must have 


WE, Es) = eu Eo, £1); 


where æ is some real constant. By repeating the interchange, we return to 


the original state, while the function y is multiplied by e#*. Hence it follows 
that 22> = lone? = +1. Thus 


WE), Eo) = + léz £1). 
227 
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We thus reach the result that there are only two possibilities: the wave 
function is either symmetrical (i.e. it is unchanged when the particles are inter- 
changed) or antisymmetrical (i.e. it changes sign when this interchange is 
made). Itis obvious that the wave functions of all the states of a given system 
must have the same symmetry; otherwise, the wave function of a state which 
was a Superposition of states of different symmetry would be neither sym- 
metrical nor antisymmetrical. 

This result can be immediately generalized to systems consisting of any 
number of identical particles. For it is clear from the identity of the particles 
that, if any pair of them has the property of being described by, say, sym- 
metrical wave functions, any other pair of such particles has the same pro- 
perty. Hence the wave function of identical particles must either be un- 
changed when any pair of particles are interchanged (and hence when the 
particles are permuted in any manner), or change sign when any pair are 
interchanged. In the first case we speak of a symmetrical wave function, and in 
the second case of an antisymmetrical one. 

The property of being described by symmetrical or antisymmetrical wave 
functions depends on the nature of the particles. Particles described by 
antisymmetrical functions are said to obey Fermi-Dirac statistics (or to be 
fermions), while those which are described by symmetrical functions are 
said to obey Bose-Einstein statistics (or to be bosons). 

From the laws of relativistic quantum mechanics it can be shown (see 
RQT, §25) that the statistics obeyed by particles is uniquely related to their 
spin: particles with half-integral spin are fermions, and those with integral 
spin are bosons. 

The statistics of compiex particles is determined by the parity of ine 
number of elementary fermions entering into their composition. For an 
interchange of two identical complex particles is equivalent to the simul- 
taneous interchange of several pairs of identical elementary particles. The 
interchange of bosons does not change the wave function, while the inter- 
change of fermions changes its sign. Hence complex particles containing 
an odd number of elementary fermions obey Fermi statistics, while those 
containing an even number obey Bose statistics. This result is, of course, 
in agreement with the above rule, since a complex particle has an integral 
or a half-integral spin according as the number of particles with half-integra! 
spin entering into its composition is even or odd. 

Thus atomic nuclei of odd atomic weight (i.e. containing an odd number of 
neutrons and protons) obey Fermi statistics, and those of even atomic weight 
obey Bose statistics. For atoms, which contain both nuclei and electrons, the 
statistics is evidently determined by the parity of the sum of the atomic 
weight and the atomic number. 


+ This terminology refers to the statistics which describes a perfect gas composed of 
particles with antisymmetrical and symmetrical wave functions respectively. In actual fact 
we are concerned here not only with a different statistics, but essentially with a different 
mechanics. Fermi statistics was proposed by E. Fermi for electrons in 1926, and its relation 
to quantum mechanics was elucidated by P. A. M. Dirac (q 926). Bose statistics was proposed 
byv'S. N. Bose for light quanta, and generalized by A. Einstein (1924). 
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Let us consider a system composed of N identical particles, whose mutual 
interaction can be neglected. Let y, We, ... be the wave functions of the vari- 
ous stationary states which each of the particles separately may occupy. 
The state of the system as a whole can be defined by giving the numbers of 
the states which the individual particles occupy. The question arises how 
the wave function ẹ of the whole system should be constructed from the 
functions gy, yo, ... - 

Let py, Po, .-- » Py be the numbers of the states occupied by the individual 
particles (some of these numbers may be the same). For a system of bosons, 
the wave function ¥(&1, £2, ..., En) is given by a sum of products of the form 


Py (Eb, (Es) se do, (En), (61.1) 


with all possible permutations of the different suffixes p4, Pa, ... ; this-sum 
clearly possesses the required symmetry property. For example, for a 
system of two particles in different states (pı # pa), 


Pln E) = lo (Ee, (En) +e (Ed (EDV 2. (61.2) 


The factor 1/42 is introduced for normalization purposes; all the functions 
Wy, we, ... are orthogonal and are supposed normalized. 

In the general case of a system containing an arbitrary number N of 
particles, the normalized wave function is 


Unig = (CERREN E E ve (Edbn (Ee) ~ YowlEn), (61.3) 


where the sum is taken over all permutations of the different suffixes 
Pi, P2, .--; Py and the numbers N; show how many of these suffixes have the 
same value z (with & N; = N). In the integration of ||? over £1, £2, ..., Ex, 
all terms vanish except the squared modulus of each term of the sum;+ 
since the total number of terms in the sum (61.3) is evidently N!/Ni!No! ..., 
the normalization factor in (61.3) is obtained. 

For a system of fermions, the wave function ¢% is an antisymmetrical 
combination of the products (61.1). For a system of two particles we have 


P(E €2) = [Pp (E1) Yp (E2) — bp (£2) bp(E1)]/ 2. (61.4) 


For the general case of N particles, the wave function can be written in the 


t The integration over $ is conventionally understood in §§63-65 as including integration 
over the coordinates and summation over o 
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form of a determinant 


Pal) dold -ha (EN) 


Py (21) Pp, (Se) vee by, (EN) 
WN.N, = (61.5) 


bp (81) Pp (£2) nee Po (En) 


Here an interchange of two particles corresponds to an interchange of two 
columns of the determinant, as a result of which the latter changes sign. 
The following important result is a consequence of the expression 
(61.5). If among the numbers pı, po, ... two are the same, two rows of the 
determinant are the same, and it therefore vanishes identically. It will be 
different from zero only when all the numbers p}, po, ... are different. Thus, 
in a system consisting of identical fermions, no two (or more) particles can 
be in the same state at the same time. This is called Pauli’s principle (1925). 


§62. Exchange interaction 


The fact that Schrédinger’s equation does not take account of the spin 
of particles does not invalidate this equation or the results obtained by means 
of it. This 1s because the electrical interaction of the particles does not 
depend on their spins.t| Mathematically, this means that the Hamiltonian 
of a system of electrically interacting particles (in the absence of a magnetic 
field) does not contain the spin operators, and hence, when it is applied to the 
wave function, it has no effect on the spin variables. Hence Schrédinger’s 
equation is actually satisfied by each component of the wave function; in 
other words, the wave function of the system of particles can be written in 
the form of a product 


Héi £2) = x(01, o2, ...)A(¥1, r2, .--), 


where the function ¢ depends only on the coordinates of the particles and the 
function y only on their spins. We call the former a coordinate or orbital wave 
function, and the latter a spin wave function. Schrédinger’s equation essen- 
tially determines only the coordinate function ¢, the function y remaining 
arbitrary. In any instance where we are not interested in the actual spin of 
the particles, we can therefore use Schrédinger’s equation and regard as the 
wave function the coordinate function alone, as we have done hitherto. 
However, despite the fact that the electrical interaction of the particles 
is independent of their spin, there is a peculiar dependence of the energy 


+ This is true only so long as we consider the non-relativistic approximation. When 
relativistic effects are taken into account, the interaction of charged particles does depend 
on their spin. 
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of the system on its total spin, arising ultimately from the principle of 
indistinguishability of similar particles. 

Let us consider a system consisting of only two identical particles. By 
solving Schrédinger’s equation we find a series of energy levels, to each of 
which there corresponds a definite symmetrical or antisymmetrical co- 
ordinate wave function ¢(r}, r2). For, by virtue of the identity of the particles, 
the Hamiltonian (and therefore the Schrédinger’s equation) of the system 
is invariant with respect to interchange of the particles. If the energy levels 
are not degenerate, the function ¢(rj, rə) can change only by a constant 
factor when the coordinates r} and rg are interchanged; repeating this 
interchange, we see that this factor can only bet + 1. 

Let us first suppose that the particles have zero spin. The spin factor for 
such particles is absent altogether, and the wave function reduces to the 
coordinate function ¢(r,, r2), which must be symmetrical (since particles with 
zero spin obey Bose statistics). Thus not all the energy levels obtained by a 
formal solution of Schrédinger’s equation can actually exist; those to which 
antisymmetrical functions ¢ correspond are not possible for the system under 
consideration. 

The interchange of two similar particles is equivalent to the operation of 
inversion of the coordinate system (the origin being taken to bisect the line 
joining the two particles). On the other hand, the result of inversion is to 
multiply the wave function ¢ by (—1)!, where / is the orbital angular momen- 
tum of the relative motion of the two particles (see §30). By comparing 
these considerations with those given above, we cunclude that a system of two 
identical particles with zero spin can have only an even orbital angular mo- 
mentum. 

Next, let the system consist of two particles with spin $ (say, electrons). 
Then the complete wave function of the system (i.e. the product of the 
function (rj, r2) and the spin function x(01, o2)) must certainly be anti- 
symmetrical with respect to an interchange of the two particles. Hence, if 
the coordinate function is symmetrical, the spin function must be anti- 
symmetrical, and vice versa. We shall write the spin function in spinor form, 
i.e. as a spinor x4# of rank two, each of whose indices corresponds to the 
spin of one of the electrons. A symmetrical spinor (y4# = y#4) corresponds 
to a function symmetrical with respect to the spins of the two particles, 
and an antisymmetrical spinor (yè = — y#4) to an antisymmetrical func- 
tion. We know, however, that a symmetrical spinor of rank two describes a 
system with total spin unity, while an antisymmetrical spinor reduces to a 
scalar, corresponding to zero spin. 

Thus we reach the following conclusion. The energy levels to which there 
correspond symmetrical solutions (ri, re) of Schrédinger’s equation can 
actually occur when the total spin of the system is zero, i.e. when the spins 
of the two electrons are “antiparallel”, giving a sum of zero. The values of 
the energy belonging to antisymmetrical functions (ri, r2), on the other hand, 


t When there is degeneracy we can always choose linear combinations of the functions 
belonging to a given level, such that this condition is again satisfied. 
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require a value of unity for the total spin, i.e. the spins of the two electrons 
must be “‘parallel’’. 

In other words, the possible values of the energy of a system of electrons 
depend on their total spin. For this reason we can speak of a peculiar inter- 
action of the particles which results in this dependence. ‘This is called 
exchange interaction. It is a purely quantum effect, which entirely vanishes 
(like the spin itself) in the passage to the limit of classical mechanics. 

The following situation is characteristic of the case of a system of two 
electrons which we have discussed. To each energy level there corresponds 
one definite value of the total spin, 0 or 1. This one-to-one correspondence be- 
tween the spin values and the energy levels is preserved, as we shall see below 
(§63), in systems containing any number of electrons. It does not hold, 
however, for systems composed of particles whose spin exceeds }. 

Let us consider a system of two particles, each with arbitrary spin s. Its 
spin wave function is a spinor of rank 4s: 


half (2s) of whose indices correspond to the spin of one particle, and the other 
half to that of the other particle. The spinor is symmetrical with respect to 
the indices in each group. An interchange of the two particles corresponds 
to an interchange of all the indices A, u, ... of the first group with the indices 
p, a, ... of the second group. In order to obtain the spin function of a state of 
the system with total spin S, we must contract this spinor with respect to 
2s—S pairs of indices (each pair containing one index from A, 4, ... and one 
from p, c, ...), and symmetrize it with respect to the remainder; as a result 
we obtain a symmetrical spinor of rank 2S. However, the contraction of a 
spinor with respect to a pair of indices means, as we know, the construction 
of a combination antisymmetrical with respect to these indices. Hence, 
when the particles are interchanged, the spin wave function is multiplied 
by (1). 

On the other hand, the complete wave function of a system of two particles 
must be multiplied by (—1)?* when they are interchanged (i.e. by +1 for 
integral sand by —1 for half-integral s). Hence it follows that the symmetry 
of the coordinate wave function with respect to an interchange of the particles 
is given by the factor (—1)8, which depends only on S. Thus we reach the 
result that the coordinate wave function of a system of two identical particles 
is symmetrical when the total spin is even, and antisymmetrical when it is 
odd. 

Recalling what was said above concerning the relation between interchange 
of the particles and inversion of the coordinate system, we conclude also 
that, when the spin S is even (odd), the system can have only an even (odd) 
orbital angular momentum. 

We see that here also a certain dependence is revealed between the possible 


values of the energy of the system and the total spin, but this dependence is 
not necessarily one-to-one. The energy levels to which there correspond 
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symmetrical (antisymmetrical) coordinate wave functions can occur for any 
even (odd) value of S. 

Let us calculate how many different states of the system there are with even 
and odd S. The quantity S takes 2s+1 values: 2s, 2s—1,...,0. For any 
given S there are 2S +1 states differing in the value of the z-component of the 
spin ((2s-+ 1)* different states altogether). Lets be ‘ntegral. Then, among 
the 2s+1 values of S, s+1 are even and s odd. The total number of states 
with even S is equal to the sum 

oF aS) = ole 
the remaining s(2s+1) states have odd S. Similarly, we find that, when s is 
half-integral, there are s(2s+-1) states with even values of S and (s+1)(2s+1) 
with odd values. 


PROBLEMS 


PROBLEM 1. Determine the exchange splitting of the energy levels of a system of rwo 
electrons, regarding the interaction of the electrons as a perturbation. 


SOLUTION. Let the particles be (when their interaction is neglected) in states with orbital 
wave functions ¢:(r) and ¢2(r). The states of the system with total spin S = 0 and S = 1 
correspond to symmetrized and antisymmetrized products respectively: 


$ = —4s(ei)soles) + du(ea)ba(es)} 
/2 


The mean value of the operator of the interaction U(re—ri) of the particles in these states 
is A+J, where 


A= | | U'\d3(r1)|?|Po(re)|? dV dV2, 


J= | | Cdi(risi*(r2)be(ra)gar(ni) dV: dV, 


the latter being called the exchange integral. Omitting the additive constant A, which is not 
an exchange term, we therefore find the level shifts AEo = J, AE; = —J (where the suffix 
indicates the value of S). These quantities can be represented as the eigenvalues of the spin 
exchange operatort 


Pexcn = —$J(14+48; . 80); (1) 


the eigenvalues of the product sı . s2 are derived in §55, Problem 2. 

If the electrons belong to different atoms, for example, the exchange integral decreases 
exponentially with increasing distance R between the atoms. Itis clear from the form of the 
integrand that this integral is determined by the “overlap’’ of the wave functions of the states 
¢i(r,) and ¢,(r.); using the asymptotic law of decrease of the wave functions of states of a 
discrete spectrum (cf. (21.6)), we find that 


J~ eter R, icy = 4/(2m|E|)/h, ro = y (2m Eal)/h, 


where E1 and E? are the energy levels of the electron in the two atoms. 
PROBLEM 2. The same as Problem 1, but for a system of three electrons. 


j SOLUTION. Using formula (1), Problem 1, we can write the operator of pairwise exchange 
interaction in a systern of three electrons as 


Pexch = — E Jal} +25a - 8), (1) 


t First used by Dirac. 
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where the summation ts over pairs of particles 12, 13 and 23. The matrix elements of the 

~ . a L - 
operators Sa . §» between states with different values of the pair of numbers og, op are given 
by formulae (55.6) as 


C3, BiSa. solk. 22=4. C3- kisa. sol} -p= l, ¢4,-3]s0-Se|—3, 3D =}. 


We first determine the energy corresponding to the greatest possible velue of the total-spin 
component Ms = o1 +02+03, viz. Ms = 3/2. This gives the energy of the state with total 
spin S = 3/2. On calculating the corresponding diagonal matrix element of the operator 


(1), we find 


sie 


SE3;2 = —(Ji2+ Jis + Jos). 


Next we take states with Ms = 34. This value can occur in three ways, depending on which 
of the numbers a1, o2, o3 is —4 (the other two being 4). Thus for these states we should 
have a secular equation of the third degree. The calculation can, however, be simplined 
immediately by noting that one of the roots of this equation must correspond to the energy 
already found for the state with S = 3/2, and the secular equation must therefore have the 
factor AE —AE3;2. In this way the calculation of the free term in the cubic equation can 
be avoided.t 


The leading terms of the equation are found to be 


(AE + (J12 +413 + J23)(SE)* + [J 12413 +J12J23 + J13J23— 


—(J12?+ J132 + Jeos*)JAE+ ... = 0. 


Dividing by 4E'+Ji24+ Ji3+Je3, we find the two energy levels corresponding to states with 
spins S = 3: 


AE = + [122+ + Jos®) — Jii- Si2de3— JisJe3}!. 


Thus there are three energy levels, in accordance with the calculation in §63, Problem 1. 
PROBLEM 3. In which states can the *Be nucleus decay into two a-particles ? 


SOLUTION., Since the &-particle has no spin, a system of two a-particles can only have an 
even orbital angular momentum (equal to the total angular momentum), and its states are 
even.” The decay in question is therefore possible only from even states of the ®Be nucleus 
with even total angular momentum. 


§63. Symmetry with respect to interchange 


By considering a system composed of only two particles, we have been able 
to show that its coordinate wave functions (rj, r2) for the stationary states 
must be either symmetrical or antisymmetrical. In the general case of a sys- 
tem of an arbitrary number of particles, the solutions of Schrédinger’s 
equation (the coordinate wave functions) need not necessarily be'either sym- 
metrical or antisymmetrical with respect to the interchange of any pair of 
particles, as the complete wave functions (which include the spin factor) 
must be. This is because an interchange of only the coordinates of two par- 
ticles does not correspond to a physical interchange of them. The physical 
identity of the particles here leads only to the fact that the Hamiltonian of the 
system is invariant with respect to the interchange of the particles, and hence, 
if some function is a solution of Schrédinger’s equation, the functions ob- 
tained from it by various interchanges of the variables will also be solutions. 

Let us first of all make some remarks regarding interchanges in general. 





+ This device is particularly useful in similar calculations for systems with a larger number 
of particles. 
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In a system of N particles, N! different permutations in all are possible. If 
we imagine all the particles to be numbered, each permutation can be 
represented by a definite sequence of the numbers 1, 2, 3,.... Every such 
sequence can be obtained from the natural sequence 1, 2, 3, ... by successive 
interchanges of pairs of particles. The permutation is called even or odd, 
according as it is brought about by an even or odd number of such inter- 
changes. We denote by P the operators of permutations of N particles, and 
introduce a quantity dp which is +1 if Ê is an even permutation and - 1 
if it is odd. If ¢ is a function symmetrical with respect to all the particles, 
we have 


P$ =ġ, 
while, if ¢ is antisymmetrical with respect to all the particles, then 


Pi sird 


From an arbitrary function ¢(f, fo, -.., ry), we can form a symmetrical 
function by the operation of symmetrization, which can be written 


aym = constant x Z P, (63.1) 


where the summation extends over all possible permutations. The formation 
of an antisymmetrical function (an operation sometimes called alternation) 
can be written as 

Pant = Constant x S SpPd. (63.2) 


Let us return to considering the behaviour, with respect to permutations, 
of the wave functions ¢ of a system of identical particles.t| The fact that the 
Hamiltonian A of the system is symmetrical with respect to all the particles 
means, mathematically, that H commutes with all the permutation operators 
PÊ. These operators, however, do not commute with one another, and so 
they cannot be simultaneously brought into diagonal form. This means that 
the wave functions ¢ cannot be so chosen that each of them is either symmetri- 
cal or antisymmetrical with respect to all interchanges separately.{ 

Let us try to determine the possible types of symmetry of the functions 
P(X, Fo .-., ry) Of N variables (or of sets of several such functions) with 
respect to permutations of the variables. The symmetry must be such that 
it cannot be increased, i.e. such that any additional operation of symmetri- 
zation or alternation, on being applied to these functions, would reduce them 
either to linear combinations of themselves or to zero identically. 

We already know two operations which give functions with the greatest 





t From the mathematical point of view, the problem is to find irreducible representations 
of the permutation group. A detailed account of the mathematical theory of permutation 
(or symmetry) groups is given by H. Weyl, The Theory of Groups and Quantum Mechanics, 
mean oie ibe Hamermesh, Group Theory and its Application to Physical 

roblems, rergamon, London, 1962: I. G. K - 

UETA el ee T ee I. G. Kaplan, Symmetry of Many-Electron Systems, 

f Except for a system of only two particles, where there 1s a single interchange operator 
which can be brought into diagonal form simultaneously with the Hamiltonian. i 
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possible symmetry: symmetrization with respect to all the variables, and 
alternation with respect to all the variables. These operations can be general- 
ized as follows. 

We divide the set of all the N variables r}, ro, ... , ry (or, What is the same 
thing, the suffixes 1, 2, 3, ... , N) into several sets, containing Nj, Np, ... ele- 
ments (variables); N,+N,+... = N. This division can be conveniently 
shown by a diagram (known as a Young diagram) in which each of the num- 
bers N,, Na, -.. is represented by a line of several cells (thus, Fig. 21 gives a 
diagram of the divisions 6+4+4+3+3+1 +1 and 74+5+4+54+3+41+1 for 
N = 22); one of the numbers 1, 2, 3, ... is to be placed in each square. If 
we place the lines in order of decreasing length (as in Fig. 21), the diagram 
contains not only successive horizontal rows, but also vertical columns. 


Fic. 21 


Let us symmetrize an arbitrary function ¢(r1, re, ..., rv) with respect to the 
variables in each row. The alternation operation can then be performed only 
with respect to the variables in different rows; alternation with respect to a 
pair of variables in the same row clearly gives zero identically. 

Having chosen one variable from each row, we can, without loss of gener- 
ality, regard them as being in the first cells in each row (after symmetrization, 
the order of the variables among the cells in each row is immaterial); let us 
alternate with respect to these variables. Having then deleted the first column, 
we alternate with respect to variables chosen one from each row in the thus 
“curtailed” diagram; these variables can again be regarded as being in the 
first cells of the “‘curtailed” rows. Continuing this process, we finally have 
the function first symmetrized with respect to the variables in each row and 
then alternated with respect to the variables in each column. After alternation, 
of course, the function in general ceases to be symmetrical with respect to 
the variables in each row. The symmetry is preserved only with respect to 
the variables in the cells of the first row which project beyond the other rows. 

Having distributed the NV variables in various ways among the rows of a 
Young diagram (the distribution among the cells in each row is immaterial), 
we thus obtain a series of functions, which are transformed linearly into one 
another when the variables are permuted in any manner.t However, it must 


+ It would be possible to perform the symmetrization and alternation in the reverse order: 
to alternate with respect to the variables in each column, and then to symmetrize with respect 
to those in the rows. This, however, would give effectively the same thing, since the functions 
obtained by the two methods are linear combinations of one another. 
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be emphasized that not all these functions are linearly independent; the 
number of independent functionsis in general less than the number of possible 
distributions of the variables among the rows of the diagram. We shall not 
pause here, however, to discuss this more closely. 

Thus any Young diagram determines some type of symmetry of functions 
with respect to permutations. By constructing all the possible Young dia- 
grams (for a given N), we find all possible types of symmetry. This amounts 
to dividing the number JN in all possible wavs into a sum of smaller terms, 
including the number AN itself; thus for N = 4 the possible partitions are 
4,341, 24+2,2+14+1, 1+141+1. 

To each energy level of the system we can make correspond a Young dia- 
gram which determines the permutational symmetry of the appropriate 
solutions of Schrédinger’s equation; in general, several different functions 
correspond to each value of the energy, and these are transformed linearly into 
each other by permutations. The existence of this “permutational de- 
generacy”’ is related to the fact that the operators Ê each commute with the 
Hamiltonian but not with one another (see the middle of §10). However, it 
must be emphasized that this does not signify any additional physical 
degeneracy of the energy levels. All these different coordinate wave functions, 
multiplied by the spin functions, enter into a single definite combination— 
the complete wave function—which satisfies (according to the spin of the 
particles) the condition of symmetry or antisymmetry. 

Among the various types of symmetry there are always (for any given NV) 
two to each of which only one function corresponds. One of these cor- 
responds to a function symmetrical with respect to all the variables, and the 
other to one which is similarly antisymmetrical; in the first case, the Young 
diagram consists of a single row of N cells, and in the second case of a single 
column. 

Let us now consider the spin wave functions x(01, 02, ..., gy). Their kinds 
of symmetry with respect to permutations of the particles are given by the 
same Young diagrams, with the components of the spins of the particles 
taking the part of variables. There arises the question of what diagram must 
correspond to the spin function for a given diagram of the coordinate func- 
tion. Let us first suppose that the spin of the particles is integral. Then the 
complete wave function ù must be symmetrical with respect to all the particles. 
For this to be so, the symmetry of the spin and coordinate functions must be 
given by the same Young diagram, and the complete wave function ¢ is 
expressed as definite bilinear combinations of the two; we shall not here pause 
to examine more closely the problem of constructing these combinations. 

Next, suppose the spin of the particles to be half-integral. Then the com- 
plete wave function must be antisymmetrical with respect to all the particles. 
It can be shown that, for this to be so, the Young diagrams for the coordinate 





t The independent functions that are transformed into linear combinations of one another 
form the basis of an irreducible representation of the permutation group. Their number is 


the dimension of the representation. For particles with spin $ the number is derived in 
Problem 1 below. 
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and spin functions must be in dual relation, i.e. obtained from each other by 
interchanging rows and columns (as in the two diagrams shown in Fig. 21). 

Let us consider in more detail the important case of particles with spin 4 
(electrons, for instance). Each of the spin variables o}, o9, ... here takes only 
the two values +}. Since a function antisymmetrical with respect to any 
two variables vanishes when these variables take the same value, it is clear 
that the function y can be alternated only with respect to pairs of variables; 
if we alternate with respect to even three variables, two of them must always 
take the same value, so that we have zero identically. 

Thus, for a system of electrons, the Young diagrams for the spin functions 
can contain columns of only one or two cells (i.e. only one or two rows); in 
the Young diagrams for the coordinate functions, the same is true of the 
number of columns. The number of possible types of permutational sym- 
metry for a system of N electrons is therefore equal to the number of possible 
partitions of the number N into a sum of ones and twos. When N is even, 
this number is 4M +1 (partitions with 0, 1, ... , $V twos), while if NV is odd 
it is $(N+1) (partitions with Q, 1, ..., {N —1) twos). Thus, for instance, 
Fig. 22 shows the possible Young diagrams (coordinate and spin) for N = 4. 


AP E 


$=2 § =7 $20 
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It is easy to see that each of these types of symmetry (i.e. each of the 
Young diagrams) corresponds to a definite total spin S of the system of 
electrons. We shall consider the spin functions in spinor form, i.e. as spinors 
xô of rank N, whose indices (each of which corresponds to the spin of 
an individual particle) will be the variables that are arranged in the cells of 
the Young diagrams. Let us examine the Young diagram consisting of two 
rows with N} and N, cells (N, +N, = N, and N, > N,). In each of the first 
N, columns there are two cells, and the spinor must be antisymmetrical with 
respect to the corresponding pairs of indices. With respect to the indices in 
the last n = Ny—WNe cells in the first row, however, it must be symmetrical. 
As we know, such a spinor of rank N reduces to a symmetrical spinor of rank 
n, to which there corresponds a total spin S = }n. Returning to the Young 
diagrams for the coordinate functions, we can say that the diagram with 7 
rows each of one cell corresponds to a total spin S = $n. For even N, the 
total spin can take integral values from 0 to 4N, while for odd N ft can take 


half-integral values from } to 4$N, as it should. 
We emphasize that this one-to-one correspondence between the Young 
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diagrams and the total spin holds only for systems of particles with spin }; 
we have seen this, for a system of two particles, in the previous section. Fora 
system of .V particles with spin s, the spin wave function is made up of a 
product of N symmetrical spinors of rank 2s, i.e. is a spinor of rank 2Ns. If 
this spinor is symmetrized according to a particular Young diagram of NV 
cells, we can usually construct from the independent components of the 
symmetrized spinor several sets of linear combinations, each set corres- 
ponding to a different total spin S of the system. 

In the same way as the Young diagram for the spin functions of particles 
with spin 3 Cannot contain columns of more than two cells, so for particles 
with any spin s the columns cannot contain more than 2s + 1 cells. 

If the number N of particles in the system is an integral multiple of 2s + 1, 
the possible Young diagrams include a rectangle with 2s+1 cells in each 
column. This corresponds to one definite value of the total spin, S = 0. 
Hence we can conclude that the same value of S corresponds to any two 
(spin) Young diagrams which can be fitted together to form a rectangle of 
height 2s+1.¢ This is a simple consequence of the fact that the addition 
of two angular momenta can give zero only if they have the same absolute 
magnitude. 

To conclude this section, let us return to the fact already mentioned in the 
footnote at the end of §20 that, for a system of several identical particles, we 
cannot assert that the wave function of the stationary state of lowest energy is 
without nodes. We can now amplify this statement and elucidate its origin. 

The wave function (that is, the coordinate function), if it has no nodes, 
must certainly be symmetrical with respect to all the particles; for, if it were 
antisymmetrical with respect to the interchange of any pair of particles 1, 2, it 
would vanish for rı = re. If, however, the system consists of three or more 
electrons, no completely symmetrical coordinate wave function is possible 
(the Young diagram of the coordinate function cannot have rows with more 
than two cells). Thus, although the solution of Schrédinger’s equation 
which corresponds to the lowest eigenvalue is without nodes (by the theorem 
of the variational calculus), this solution may be physically inadmissible; 
the smallest eigenvalue of Schrédinger’s equation will not then correspond 
to the normal state of the system, and the wave function of this state will in 
general have nodes. For particles with a half-integral spin s, this situation 
occurs in systems with more than 2s+1 particles. For systems of bosons, a 
completely symmetrical coordinate wave function is always possible. 


PROBLEMS 
PROBLEM 1. Determine the number of energy levels with different values of the total 
spin S, for a system of N particles with spin }. 


~~ * 
t For example, the two diagrams (for s = 1) 


i] 
=m al ww S 
lp 4 i | l 


The continuous and broken lines show the complementary diagrams., 
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SOLUTION. A given value of the projection of the total spin of the system, Ms = È o, can 
be obtained in 


f(Ms) = NYGN + Ms) (AN — Ms)! 


ways, with 4N + Ms particles taken to have o = } and the remainder o = —}. To each 
energy level with a given S, there correspond 2S+1 states with Ms = S, S—1, ..., —S. 
Hence it is easy to see that the number of different energy levels with a given value of S is 


n(S) = {(S)-f(S+1) = NWU28S+D/(AN+S+D1GN—S)!. 
The total number of different energy levels is 
n = Zn(S) = f (0) = NGN) 
for even N, and 
n =f ($) = NYGN+YIGN-P! 


for odd N. 

PROBLEM 2, Find the values of the total spin S that occur for various types of symmetry 
of the spin functions of a system of two, three or four particles with spin 1. 

SOLUTION. For two particles, the correspondence is established by the fact that the factor 
by which the spin function is multiplied when the particles are interchanged must be (—1)**—-5 
(see the end of §62). For particles with spin s = 1 this gives 


kone (b) H 
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(1) 


The Young diagrams for a system of three particles are obtained by adding to the diagrams 
(1) one cell in every possible way. The result may be written as the symbolic equations 


(b) 


The values of S are shown beneath each diagram, and the values of the total spin of the 
system of three particles (the diagrams on the right) are found from the spins of the two- 
particle and one-particle systems (the diagrams on the left) by the rule of addition of angular 
momenta.t The distribution of the resulting values of S among the diagrams on the right is 
established by noting that diagram (c) (a column of three cells) corresponds to S = 0, and 
(b) therefore to the remaining values 1 and 2 in the second equation, while (a) belongs to the 


+ The repetition of 1 beneath the right-hand diagrams occurs because this value of the 
angular momentum comes firstly from adding the angular momenta O and 1, and secondly 
from adding 2 and 1. 
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values 1 and 3 that are left after (b) has been labelled in the first equation: 


et oy T 
S-i o3 (2) 
S=0 


S=lorZ 


The Young diagrams for a system of four particles are obtained by adding one cell to the 
diagrams (2), with the condition that no column should contain more than three cells: 


(a) (b) 


Litt) + [th 


ee ee a 
1,3 | 
O 2,2,84 


(b) CE3 (d) 


Fo H p 


ae 
8 


O | 


Diagram (c) can be added to (a) in (1) to form a rectangle with three-cell columns, and there- - 
fore corresponds to the same values S = 0, 2. The values of S for diagram (b) are found 
from the remainder of the second equation, and then those for (a) from the remainder of the 
first equation: 


tom (ota| aes tet tT (d) 
$=0,2o0r4 
$=! 


S =1,2 or 3 S$=0 or2 


§64. Second quantization. The case of Bose statistics 


In the theory of systems consisting of a large number of identical particles, 
there is a widely used method of considering the problem, known as second 
quantization. ‘This method is especially necessary in relativistic theory, 


where we have to deal with systems in which the number of particles is itself 
variable.t 


. + The method of second quantization was developed by P. A. M. Dirac (1927) for photons 
in radiation theory, and later extended to fermions by E. Wigner and P. Jordan (1928). 
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Let (é), e(&), ... be some complete set of orthogonal and normalized 
wave functions of stationary states of a single particle.t These may be states 
of a particle in some arbitrarily chosen external field, but are usually taken to 
be simply plane waves, 1.e. the wave functions of a free-particle having definite 
values of the momentum (and spin projection). In order to make the spectrum 
of states discrete, we shall consider the motion of particles in a large but finite 
region, for which the eigenvalues of the momentum components form a 
discrete series, the intervals between adjacent values being inversely pro- 
portional to the linear dimensions of the region and tending to zero as these 
increase. 

In a system of free particles, the particle momenta are separately con- 
served. The occupation numbers of the states are therefore also conserved, 
i.e. the numbers Nj, No, ... which show how many particles are in each of the 
States 1, Y2, .... In a system of interacting particles, the momentum of each 
particle is not conserved, and so the occupation numbers are not conserved. 
For such a system we can consider only the probability distribution of the 
various values of the occupation numbers. Let us seek to construct a 
mathematical formalism in which the occupation numbers (and not the 
coordinates and spin projections of the particles) play the part of independent 
variables. 

In this formalism, it is convenient to use the Dirac notation (see the end of 
$11), taking N,,N2, ... as quantum numbers defining the state. The states 
corresponding to the wave functions (61.3) and (61.5) will be denoted by 
|N1,N2, ...>. The coordinate and spin variables of the particles are not 
shown explicitly. 

In accordance with this choice of the independent variables, the operators 
of the various physical quantities (including the Hamiltonian of the system) 
must be formulated in terms of their action on functions of the occupation 
numbers. Such a formulation can be obtained on the basis of the usual matrix 
representation of operators. The operator matrix elements must be con- 
sidered in relation to the wave functions of the stationary states of a system 
of non-interacting particles. Since these states can be described by speci- 
fying definite values of the occupation numbers, this will also show the nature 
of the action of the operators on these variables. j 

Let us first consider systems of particles obeying Bose statistics. Let f Va 
be the operator of some quantity pertaining to the ath particle, i.e. acting only 
on functions of the variables a. We introduce the operator 


Fo = 5 fe, (64.1) 


which is symmetrical with respect to all the particles (the summation being 
over all particles), and determine its matrix elements with respect to the wave 
functions (61.3). First of all, it is easy to see that the matrix elements will 


+ As in §61, € denotes the set of the coordinates and the spin projection a of the particle, 
and integration with respect to £ is taken to mean integration over the coordinates and sum- 


mation over o. 
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be different from zero only for transitions which leave the numbers Nj, Nas --- 
unchanged (diagonal elements) and for transitions where one of these numbers 
is increased, and another decreased, by unity. For, since each of the operators 
f™, acts only on one function in the product Yn (Erp, (Eo) --- bp ( Ew)» 
its matrix elements can be different from zero only for transitions whereby 
the state of a single particle is changed; this, however, means that the number 
of particles in one state 1s diminished by unity, while the number in another 
state is correspondingly increased. The calculation of these matrix elements 
is in principle very simple; it is easier to do it oneself than to follow an account 
of it. Hence we shall give only the result of this calculation. The non- 
diagonal elements are 


| (Ni, Ne- UFÐIN -1, No = f Min (Nix). (64.2) 


We shall indicate only those suffixes with respect to which the matrix element 
is non-diagonal, omitting the remainder for brevity. Here f),, is the matrix 
element 


Lt Ef O Sa = f WOA 66 (64.3) 


since the operators f (1), differ only in the naming of the variables on which 
they act, the integrals (64.3) are independent of a, which is therefore 
omitted. The diagonal matrix elements of F™ are the mean values of the 
quantity F® in the states wy y,... Calculation gives 


FO F f%,,N,. (64.4) 


We now introduce the operators å; which play a leading part in the method 
of second quantization; they act, not on functions of the coordinates, but on 
functions of the occupation numbers. By definition, the operator 4; acting on 
the function |N,, N2, ...> decreases the value of the variable N; by unity, 
and at the same time it multiplies the function by VN;:t 


âil Ni, No, ...3 Ni, 2 = VIN, Nə, TET N;—1, tee (64.5) 


We can say that the operator â; diminishes by one the number of particles in 
the 7th state; it is therefore called a particle annihilation operator. It can be 
represented in the form of a matrix whose only non-zero element is 


«Ni — llai| Ni = V Ni. (64.6) 


The operator 4;+ which is the Hermitian conjugate of â; is, by definition 
(see (11.9)), represented by a matrix whose only non-zero element is 


(Nila; ti Ni- 1> = <N; ~ Ilai Ni>* = a/ Ni. (64.7) 


tHere we use the notation â|m> with the natural sense of th 


i e result of the operator 4 acting on 
the state wave function |m> P g 
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This means that, when acting on the function @(Nj, N2, ...), it increases the 
number N; by unity: 


&|Ni, No, ..., Ni, -> = V(NAL)IN:, No, 0. N;+1, ...>. (64.8) 


In other words, the operator @;* increases by one the number of particles in 
the zth state, and is therefore called a particle creation operator. 

The product of the operators 4;+d;, acting on the wave function, must 
multiply it by a constant simply, leaving unchanged all the variables Ny, N2, 
...: the operator á; diminishes N; by unity, and å+ then restores it to its 
original value. Direct multiplication of the matrices (64.6) and (64.7) shows 
that 4; +d; is represented, as we should expect, by a diagonal matrix whose 
diagonal elements are N;. We can write 


i tds = Nj. (64.9) 


Similarly, we find that 
áit = Na+. (64.10) 


The difference of these equations gives the commutation rule for the 
operators å; and 4,*: 


áit — it; = 1. (64.11) 


The operators with 2 and k different act on different variables (N; and Nx), 
and cominute: 


Gay —aya, = 0, aap +—Gy td; = 0 (i Æ k). (64.12) 


From the above properties of the operators d;, å;* it is easy to see that the 
operator 


Fo = AS nda, (64.13) 


is the same as the operator (64.1). For all the matrix elements calculated from 
(64.6), (64.7) are the same as the elements (64.2), (64.4). This is a very 
important result. In formula (64.13), the quantities f®,, are simply 
numbers. Thus we have been able to express an ordinary operator, acting on 
functions of the coordinates, in the form of an operator acting on functions 
of new variables, the occupation numbers Nj. 

The result which we have obtained is easily generalized to operators of 
other forms, Let 


fo = 2 fav» (64.14) 


ae f 2) is the operator of a physical quantity pertaining to two particles 
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at once, and hence acts on functions of é, and €,. Similar calculations show 
that this operator can be expressed in terms of the operators 4,, d;*+ by 


as LE „KIRIS Ollm ai âk *dmay, (64.15) 


where 


Cik| f Dlm = SS WEEE Enl) dédé, 


The generalization of these formulae to operators of any other form sym- 
metrical with respect to all the particles (of the form $Ð = E f (dabo, etc.) is 
obvious. 

These formulae can be used to express, in terms of the operators å; and 4;+, 
the Hamiltonian of the physical system of N identical interacting particles 
that is being considered. The Hamiltonian of such a system is, of course, 
symmetrical with respect to all the particles. In the non-relativistic approxi- 
mation,t it is independent of the spins of the particles, and can be represented 
in a general form as follows: 


= (1) (2) (3) 
H py Ao + 2 U(r re) + 2U (tarp re) + e. (64.16) 


Here H), is the part of the Hamiltonian which depends on the coordinates 
of the ath particle only: 


HO, = —(h?/2m)A a+ U%r,), (64.17) 


where U(r.) is the potential energy of a single particle in the external field. 
The remaining terms in (64.16) correspond to the mutual interaction energy 
of the particles; the terms depending on the coordinates of two, three, etc. 
particles have been separated. 

This representation of the Hamiltonian enables us to apply formulae 
(64.13), (64.15) and their analogues directly. Thus 


A= EH ,d+4,+4 5 <ik| U| limy: *âk*âmőr+ .... (64.18) 


This gives the required expression for the Hamiltonian in the form of an 
operator acting on functions of the occupation numbers. 

For a system of non-interacting particles, only the first term in the expres- 
sion (64.18) remains: 


H= 2H ,4*4,. (64.19) 


If the functions p; are taken to be the eigenfunctions of the Hamiltonian 
H® of an individual particle, the matrix H, is diagonal, and its diagonal 
elements are the eigenvalues «; of the energy of the particle. Thus 


Å = py câ â; 


+ In the absence of a magnetic field. 
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replacing the operator d@;*d; by its eigenvalues (64.9), we have forthe energy 
levels of the system the expression 


E = = EN 


a trivial result which could have been foreseen. 
The formalism which we have developed can be put in a more compact 
form by introducing the w-operatorst 


$() = F pE PO = VpOae, (64.20) 


where the variables é are regarded as Poe By what has been said 
above concerning the operators d;, d;*, it is clear that the operator ý de- 
creases the total number of particles in the system by one, while + increases 
it by one. 

It is easy to see that the operator ý +(£o) creates a particle at the point £o. 
For the result of the action of the operator d;+ is to create a particle in a state 
with wave function pi(g). Hence it follows that the result of the action of 
the operator i+(o) is to create a particle in a state with wave function 
D wi*(E)di(Eo) = (E— Eo), which corresponds to a particle with definite 
values of the coordinates (and spin). Here we have used formula (5.12).] 

The commutation rules for the % operators are obtained at once from those 
for the operators d;, d;*: 


HERE) — HE) P(E) = 0, (64.21) 
HEIHE) FEMS = E pléite) = (E-E). (64.22) 


The second-quantized operator F® can be written by means of the 
ys operators in the form 


Fo = f $e f mde dé (64.23) 


where it is understood that the operator f “) acts on functions of the para- 
meters ¿in ¥(£). For, substituting ý and ý+ in the form (64.20) and using 
the definition (64.3), we return to (64.13). Similarly, (64.15) becomes 


FO = 4 fb Ob-E)F MEME dE de. (64.24) 


+ Note the analogy between (64.20) and the expansion 


w= Lay 


A no oat : 2 
of anv wave function in terms of a complete set of functions. Here it 1s re-quantized’’, and 
this is the reason for the term second quantization method. 
t &(€—€,) conventionally denotes the product 


8(x—%9)8(—F0)8(F— 20) 840 k 
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In particular, the Hamiltonian of the system, expressed in terms of the 
y operators, is 


E | -TOAND ORONG] dg 
+ EEE P(E )UQLE, EV G(EVEE) dé dé’ +... . (64.25) 


The operator £+(£)¥(€), constructed from the Ņ operators by analogy 
with the product &* which determines the probability density for a particle 
in a state with wave function w, is called the particle density operator. The 
integral 


N = f dtd dé (64.26) 


represents in the second-quantization formalism the operator of the total 
number of particles in the system. For, substituting the % operators in the 
form (64.20) and using the normalization and the orthogonality of the wave 
functions, we have 


N = Data. 


Each term in this sum is the operator of the number of particles in the ith 
state; according to (64.9), its eigenvalues are equal to the occupation numbers 
Nj, and the sum of all these numbers is the total number of particles in the 
system. 

Lastly, if the system consists of bosons of various kinds, operators @ and â+ 
for each kind of particle must be defined in the second quantization method. 
It is evident that operators pertaining to particles of different kinds commute. 


§65. Second quantization. The case of Fermi statistics 


The basic theory of the method of second quantization remains wholly 
unchanged for systems of identical fermions, but the actual formulae for the 
matrix elements of quantities and for the operators å; are naturally different. 

The wave function Yn, ... now has the form (61.5). Because of the 
antisymmetry of this function, the question of its sign arises first of all. 
This question did not arise in the case of Bose statistics, since, because of 
the symmetry of the wave function, its sign, once chosen, was preserved 
under all permutations of the particles. In order to make definite the sign 
of the function (61.5), we shall agree to choose it as follows. We number 
successively, once and for all, all the states hi. We then complete the rows 





t For systems containing a specified number of particles these statements are trivial, as 
are the properties of the Hamiltonian (64.19) of a system of free particles. Their generalization 
in the relativistic theory, however, yields new results that are by no means trivial (cf. ROT, 


§11) 
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of the determinant (61.5) so that always 
Pi < P2 < Pa < o < py, (65.1) 


whilst in the successive columns we have functions of the different variables 
in the order &, &,..., &y. No two of the numbers Pi» Pas -- can be equal, 
since otherwise the determinant would vanish. In other words, the occupa- 
tion numbers N; can take only the values 0 and 1. 

Let us again consider an operator of the form (64.1), F® = E f Da. As 
in §64, its matrix elements will be non-zero only for transitions where all the 
occupation numbers remain unchanged and for those where one occupation 
number (N;) is diminished by unity (becoming zero instead of one) and an- 
other (N) is increased by unity (becoming one instead of zero). We easily 
find that, for: < kà, 


Cia Ox| FO On 1p >= fP 128 kn, (65.2) 


where by O; 1; we signify N; = 0, N: = 1 and the symbol X(A, /) denotes 
the sum of the occupation numbers of all states from the Ath to the /th:T 


For the diagonal elements we obtain our previous formula (64.4): 
Fa = D Sf PaNNa- (65.3) 


In order to represent the operator F in the form (64.13), the operators 
á; must be defined as matrices whose elements are 


<Oilai|l:> = (1glagt|Og> = (— 1)" i. (65.4) 


On multiplying these matrices, we find, for k > 1, 
Cli, Oxlartar|Oi 1Y = <li, Orlas +0: O4><0;, Ox|@x]0z, 1xY 


=(—1 ya, ih 1)*4. i-N)+2i4+1, k-1) 


or 


Cli, Oxlar *ar|Oi, 1g) = (— 1) k-n, (65.5) 


Ifi = k, the matrix of 4,*+4, is diagonal, and its elements are unity for N; = 1, 
and zero for N; = 0; this can be written 


ata; = N,. (65.6) 


+ For : > k the exponent in (65.2) becomes I (k+1, 1—1). The sum must be taken as 
zero when f = Rki1. 
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On substituting these expressions tn (64.13), we in fact obtain (65.2), (65.3). 
Multiplying d;+, d, in the opposite order, we have 


Cli, Oklagar tlO 1e) = Che, Orlarili, Le><1i, Telae*]O;, 14) 
om (— 1)34. 4145041, M21, -11 


or 


Cli Ox) axa; +|0;, l> = —(— 1). k-1), (65.7) 


Comparing (65.7) with (65.5), we see that these quantities have opposite signs, 
Le, 
d dg + G,a;* =) (i # k). 


For the diagonal matrix 4,d4;*, we find 
d,a,t+ = 1—N;. (65.8) 
Adding this to (65.6), we obtain 


A A 


ái; +å å = l. 
Both the above equations can be written in the form 
Ĝiĝkt tH åkt Âi = Say. (65.9) 


On carrying out similar calculations, we find for the products â, the re- 
lations 
Mdyt dyads = 0, (65.10) 


and in particular åå; = 0. 

Thus we see that the operators å; and d, (or d,*) for í + k anticommute, 
whereas in the case of Bose statistics they commuted with one another. This 
difference is perfectly natural. In the case of Bose statistics, the operators 
â; and d, were completely independent; each of the operators â; acted only 
on a single variable N,, and the result of this action did not depend on the 
values of the other occupation numbers. In the case of Fermi statistics, 
however, the result of the action of the operator â; depends not only on the 
number V, itself, but also on the occupation numbers of all the preceding 
States, as we see from the definition (65.4). Hence the action of the various 
operators ĝ;, d, cannot be considered independent. 

The properties of the operators é;, @;* having been thus defined, all the 
remaining formulae (6+.13)-(64.18) remain valid. The formulae (64.23)- 
(64.25), which express the operators of physical quantities in terms of the 
y-operators defined by (64.20), also hold good. The commutation rules 
(64.21), (64.22), however, are now replaced by 


PENE + MEDE) = E-E’), (65.11) 
HEJME + MEME) = 0. (65.12) 
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If the system consists of particles of different kinds, second quantization 
operators must be defined for each kind of particle (as already mentioned at 
the end of §64). Operators belonging to bosons and fermions commute; 
those belonging to different fermions may formally be regarded as either 
commutative or anticommutative within the limits of non-relativistic theory. 
On either assumption the results obtained by means of the.second quantiza- 
tion method are the same. 

However, with a view to later applications in the relativistic theory, which 
allows different particles to be transformed into one another, we should 
assume that the creation and annihilation operators for different fermions 
anticommute. This becomes evident if we regard as ‘‘different’’ particles 
two different internal states of a single complex particle. 


CHAPTER X 


THE ATOM 


§66. Atomic energy levels 


In the non-relativistic approximation, the stationary states of the atom are 
determined by Schrédinger’s equation for the system of electrons, which 
move in the Coulomb field of the nucleus and interact electrically with one 
another; the spin operators of the electrons do not appear in this equation. 
As we know, for a system of particles in a centrally symmetric external field 
the total orbital angular momentum L and the parity of the state are conserved. 
Hence each stationary state of the atom will be characterized by a definite 
value of the orbital angular momentum L and by its parity. Moreover, the 
coordinate wave functions of the stationary states of a system of identical 
particles have a certain permutational symmetry. We have seen in §63 that, 
for a system of electrons, a definite value of the total spin of the system cor- 
responds to each type of permutational symmetry (i.e. to each Young dia- 
gram). Hence every stationary state of the atom is characterized also by the 
total spin S of the electrons. 

The energy level having given values of S and L is degenerate to a degree 
equal to the number of different possible directions in space of the vectors S 
and L. The degree of the degeneracy from the directions of L and S is re- 
spectively 2L +1 and 2S+1. Consequently, the total degree of the degener- 
acy of a level with given L and S is equal to the product (2L 4+1)(2S +1). 

In fact, however, the electromagnetic interaction of the electrons contains 
relativistic effects, which depend on their spins. These effects have the 
result that the energy of the atom depends not only on the absolute magni- 
tudes of the vectors L and S but also on their relative positions. Strictly 
speaking, when the relativistic interactions are taken into account the orbital 
angular momentum L and the spin S of the atom are not separately conserved. 
Only the total angular momentum J = L +S is conserved; this is a universal 
and exact law which follows from the isotropy of space relative to a closed 
system. For this reason, the exact energy levels must be characterized by 
the values / of the total angular momentum. 

However, if the relativistic effects are comparatively small (as often 
happens), they can be allowed for as a perturbation. Under the action of 
this perturbation, a degenerate level with given L and S is “split” into a 
number of distinct (though close) levels, which differ in the value of the 
total angular momentum J. These levels are determined (in the first approxi- 
mation) by the appropriate secular equation (§39), while their wave functions 
(in the zeroth approximation) are definite linear combinations of the wave 
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functions of the initial degenerate level with the given L and S. In this 
approximation we can therefore, as before, regard the absolute values of the 
orbital angular momentum and spin (but not their directions) as being 
conserved, and characterize the levels by the values of L and S also. 

Thus, as a result of the relativistic effects, a level with given values of L 
and S is split into a number of levels with different values of J. This splitting 
is called the fine structure (or the multiplet splitting) of the level. As we know, 
J takes values from L+S to |L— S|; hence a level with given L and S is split 
into 2S+1 (if L > S) or 2L +1 (if L < S) distinct levels. Each of these is 
still degenerate with respect to the directions of the vector J; the degree of 
this degeneracy is 2/+1. It is easily verified that the sum of the numbers 
2J +1 for all possible values of J is equal to (2£+1)(2S +1), as it should be. 

There is a generally accepted notation to denote the atomic energy levels 
(or, as they are called, the spectral terms of the atoms), similar to that used 
for the states of individual particles with definite values of the angular 
momentum (§32): states with ditferent values of the total orbita] angular 
momentum L are denoted by capital Latin letters, as follows: 


B=O. |) 2s + & & 7 Bo 10 
S P DFGOGHIDKEMN 


Above and to the left of this letter is placed the number 2S +1, called the 
multiplicity of the term (though it must be borne in mind that this num- 
ber is the number of fine-structure components of the level only when 
L 2 S).+ Below and to the right of the letter is placed the value of the total 
angular momentum J. Thus the symbols *P,/., ?P;,. denote levels with 
L=1,S= 4, J = fand 3. 


§67. Electron states in the atom 


An atom with more than one electron is a complex system of mutually 
interacting electrons moving in the field of the nucleus. For such a system 
we can, strictly speaking, consider only states of the system as a whole. 
Nevertheless, it is found that we can, with fair accuracy, introduce the idea 
of the states of each individual electron in the atom, as being the stationary 
states of the motion of each electron in some effective centrally symmetric 
field due to the nucleus and to all the other electrons. These fields are in 
general different for different electrons in the atom, and they must all be 
defined simultaneously, since each of them depends on the states of all the 
other electrons. Such a field is said to be self-consistent. 

Since the self-consistent field is centrally symmetric, each state of the elec- 
tron is characterized by a definite value of its orbital angular momentum l. 
The states of an individual electron with a given J are numbered (in order 
of increasing energy) by the principal quantum number n, which takes the 
values n = 1+1, 1+2, ... ; this choice of the order of numbering 1s made in 


+ The levels with 2S +1 = 1, 2,3, ... are called singlet, doublet, triplet, etc., levels. 
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accordance with what is usual for the hydrogen atom. However, the sequence 
of levels of increasing energy for various / in complex atoms is in general 
different from that found in the hydrogen atom. In the latter, the energy 
is independent of /, so that the states with larger values of n always have 
higher energies. In complex atoms, on the other hand, the level with n = 5, 
l = 0, for example, is found to lie below that with n = 4, l = 2 (this is 
discussed in more detail in §73). 

The states of individual electrons with different values of n and I are 
customarily denoted by a figure which gives the value of the principal 
quantum number, followed by a letter which gives the value of /:} thus 4d 
denotes the state with n = 4, l = 2. A complete description of the atom 
demands that, besides the values of the total L, S and J, the states of all the 
electrons should also be enumerated. Thus the symbol 1s 2p 3Po denotes a 
state of the helium atom in which L = 1, S = 1, J] = 0 and the two electrons 
are in the Ís and 2p states. If several electrons are in states with the same Z 
and n, this is usually shown for brevity by means of an index: thus 3p? 
denotes two electrons in the 3p state. The distribution of the electrons in the 
atom among states with different / and n is called the electron configuration. 

For given values of n and J, the electron can have different values of the 
projections of the orbital angular momentum (m) and of the spin (c) on the 
z-axis. For a given J, the number m takes 2/+1 values; the number o is 
restricted to only two values, +4. Hence there are altogether 2(2/+1) 
different states with the same n and Z; these states are said to be equivalent. 
According to Pauli’s principle there can be only one electron in each such 
state. Thus at most 2(2/+1) electrons in an atom can simultaneously have 
the same n and J. An assembly of electrons occupying all the states with the 
given n and Zis called a closed shell of the type concerned. 

The difference in energy between atomic levels having different L and S 
but the same electron configuration{ is due to the electrostatic interaction 
of the electrons. These energy differences are usually small, and several 
times less than the distances between the levels of different configurations. 
The following empirical principle (Hund’s rule; F. Hund 1925) is known 
concerning the relative position of levels with the same configuration but 
different L and S: 

The term with the greatest possible value of S (for the given electron con- 
figuration) and the greatest possible value of L (for this S) has the lowest energy. || 

We shall show how the possible atomic terms can be found for a given elec- 
tron configuration. If the electrons are not equivalent, the possible values 





t Another terminology often used ts that in which electrons with principal quantum 
numbers n = 1, 2, 3, ... are said to belong to the K, L, M, ... shells (see §74). 
t We here ignore the fine structure of each multiplet level. 

i The requirement that S should be as large as possible can be explained as follows. Let 
us consider, for example, a system of two electrons. Here we can have S = Oor S = 1; the 
spin 1 corresponds to an antisymmetrical coordinate wave function ¢(r,, rz). For r; = ra, 
this function vanishes; in other words, in the state with S = 1 the probability of finding the 
two electrons close together is small. This means that their electrostatic repulsion is com- 
paratively small, and hence the energy is less. Similarly, for a system of several electrons. the 
“most antisymmetrical”’ coordinate wave function corresponds to the greatest spin. 
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of L and S are determined immediately from the rule for the addition of 
angular momenta. Thus, for instance, with the configurations np, n'p (n, n' 
being different) the total angular momentum L can take the values 2, 1, 0, 
and the total spin S = 0,1; combining these, we obtain the terms 13S, 
13P 13]), 

If we are concerned with equivalent electrons, however, restrictions im- 
posed by Pauli’s principle make their appearance. Let us consider, for 
example, a configuration of three equivalent p electrons. For /= 1 (the p 
state), the projection m of the orbital angular momentum can take the values 
m = 1, 0, —1, so that there are six possible states, with the following values 
of mand oc: 


(a) 1,3 (b) 0,4 (c) —I1,4 
(a) 1, —} (6°)0,—% (c)—1, 4. 


The three electrons can be one in each of any three of these states. As a 
result we obtain states of the atom with the following values of the projections 
M, = im, Ms = Èo of the total orbital angular momentum and spin: 


(ata’+b)2,4 (ata’+c) 1,4 (a+b+c) 0, 3 
(a+b+b') 1,4 (a+b+c') 9,4 

(a+b'+c) 0,4 

(a’+b+c) 0, 4. 


The states with M, or Mg negative need not be written out, since they give 
nothing different. The presence of a state with Mz = 2, M şs = } shows that 
there must be a 2D term, and to this term there must correspond one state 
(1, 4) and one (0, 4). Next, there remains one state with (1, 4), so that there 
must be a 2P term; one of the states (0, 4) corresponds to this. Finally, there 
remain the states (0, 3) and (0, 3), corresponding to a 4S term. Thus, for a 
configuration of three equivalent p electrons, the only possibilities are one 
term of each of the types 7D, *P, 4S. 

Table 1 gives the possible terms for various configurations of equivalent 
p and d electrons. The figures below the letters of the terms show the num- 
ber of terms of the type concerned that exist for the given configuration, if 
this number is more than one. For the configuration with the greatest 
possible number of equivalent electrons (s?, pê, d!°, ...), the term is always 1S. 
Like terms always correspond to configurations which differ in that one of 
them has as many electrons as the other lacks to form a closed shell. This is 
an evident result of the fact that the absence of an electron from the shell can 
be regarded as a “hole”, whose state is defined by the same quantum numbers 
as the state of the missing electron. 

When Hund’s rule is applied to determine the ground term of an atom 
from a known electron configuration, only the unfilled shell need be con- 
sidered, since the moments of electrons in closed shells cancel out. For 
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Possible terms for configurations of equivalent electrons 
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p*, p? 1SD 3p 
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d, d? | 2D 
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example, let there be four d electrons outside the closed shells in an atom. 
The magnetic quantum number of the d electron can take five values: 
0, +1, +2. Hence all four electrons can have the same spin component 
c = 4, and the maximum possible total spin is S = 2. We must then assign 
to the electrons different values of m so as to give the maximum value of 
M, = Em, namely 2, 1, 0, —1, My, = 2. This means that the maximum 
value of L for S = 2 is also 2, and the term is 5D. 


PROBLEM 


Find the orbital wave functions of the possible states of a system of three equivalent p 
electrons. 


SoLuTION. In the states 4S the spin projections g of all the electrons are the same, and the 
values of m are therefore different. The wave function is given by a determinant of the form 
(61.5) composed of the functions wp, Yı, Yı (where the suffix shows the value of m). 

For the ?D term we consider the state with the maximum possible value Mr = 2. Two 
of the components m will be 1 and the other 0. Let electrons 2 and 3 have ¢ = +4 and 
electron 1 have o = —} (corresponding to total spin S = 4). The orbital wave function 
having the required symmetry is 


l = 
p= yy Poa) — do(3)¥1(2)}, 


the argument of each function being the number of the electron to which 1t refers. 

For the 2P term we consider the state with Mg = 1 and the same values of the electron 
spin components as previously. This state can be obtained with two different sets of values 
of m, so that the orbital wave function is given by the linear combination 


4 = af- + b100, 
-ini = pa p-(2(3)— 4-134412), 
Poo = po( Da (23) — y1(3)Yoo(2)}.- 
To determine the coefficients, we use the relation 
Lip = (b0 hD iy = 0, 


which must be satisfied by the wave function with M = L 27.8 Usi z a 
elements (27.12), we find that $ (see ( )). Using the matrix 


gy = 0, Lyi = Vo, Lpo = V24, 
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and so 

Lid = y2(a-b)bon = 0. 
Hence a—b = 0, and using also the normalization condition, we have a = b = 3 


The wave functions of states with Jf, < L are obtained trom those found above by applv- 
ing to them the operator L_. 


$68. Hydrogen-like energy levels 


The only atom for which Schrédinger’s equation can be exactly solved 
is the simplest of all atoms, that of hydrogen. The energy levels of the hydro- 
gen atom, and of the ions Het, Litt. ... which each have only one electron, 
are given by Bohr’s formula (36.10) 

mE: l 
pornn hni (68.1) 
2h2(14-m/M) w 
Here Ze is the charge on the nucleus, M its mass, and m the mass of the elec- 
tron. We notice that the dependence on the mass of the nucleus is only very 
slight. 

The formula (68.1) does not take account of any relativistic effects. In 
this approximation there is an additional (accidental) degeneracy, peculiar 
to the hydrogen atom, of which we have already spoken in §36; for a given 
principal quantum number 2, the energy is independent of the orbital angular 
momentum I. 

Other atoms have states whose properties recall those of hydrogen. We 
refer to highly excited states, in which one of the electrons has a large principal 
quantum number, 2nd so is mostly at large distances from the nucleus. The 
motion of such an electron can be regarded, to a certain approximation, as 
motion in the Coulomb field of the rest of the atom, whose effective charge Is 
unity. The values of the energy levels thus obtained are, however, too in- 
exact; it is necessary to apply to them a correction to take account of the devia- 
tion of the field from the pure Coulomb field at small distances. The nature 
of this correction is easily ascertained from the following considerations. 

Since the states with large quantum numbers are quasi-classical, the energy 
levels can be determined from Bohr and Sommerfeld’s quantization rule 
(48.6). The deviation from the Coulomb field at distances from the nucleus 
small compared with the “orbit radius” can be formally allowed for by an 
alteration in the boundary condition imposed on the wave function at 
r= 0. This brings about a change in the constant y in the quantization 
condition for radial motion. Since this condition is otherwise unchanged, 
we can conclude that we obtain for the energy levels an expression which 
differs from that for hydrogen in that the radial, that is, the principal, quantum 
number n is replaced by n + ân where A; is some constant (known as 
Ry dberg's correction): 

met 1 


(68.2) 


É 2R? (n+ A,)? 


§69 The self-consistent field 257 


Rydberg’s correction is (by definition) independent of n, but it is of course 
a function of the azimuthal quantum number / of the excited electron (which 
we add as a suffix to \), and of the angular momenta L and S of the whole 
atom. For given L and S, A; decreases rapidly as Z increases. The greater /, 
the less time the electron spends near the nucleus, and hence the energy levels 
must approach more and more closely those of hydrogen as / increases. 


PROBLEM 


Find the asymptotic form of the wave function for the hydrogen-like s state of an electron 
at large distances from the rest of the atom. 


SOLUTION. At large distances, where the field U = —1/r (in atomic units), the required 
function ¢(7) satisfies Schrédinger’s equation 


PŽP- ipt ty = 0 


where x = +/(2|E|). Seeking the solution in the form ¢% = constant x r’e-*"’ and neglecting 
terms in the equation that decrease more rapidly than Ņ;r, we find 


ys = constant x rilis -le-*r, 


§69. The self-consistent field 


Schrédinger’s equation for atoms containing more than one electron 
cannot be solved in an analytical form. Approximate methods of calculating 
the energies and wave functions of the stationary states of the atoms are 
therefore important. The most important of these methods is what is called 
the self-consistent field method. The idea of this method consists in regarding 
each electron in the atom as being in motion in the “self-consistent field” 
due to the nucleus together with all the other electrons. 

As an example, let us consider the helium atom, restricting ourselves to 
those terms in which both the electrons are in s states (with or without the 
same 7); the states of the whole atom will then be S states also. Let ,(7;) 
and (rs) be the wave functions of the electrons; in the s states they are 
functions only of the distances 7,, r, of the electrons from the nuclei. The 
wave function (r1, r2) of the atom as a whole is a symmetrized 


b = dy(7,)bo(72) +dr(72)P0(71) (69.1) 


or antisymmetrized 


Y = dy(r,)ho(r2)— Alal) (69.2) 


t As an illustration, we may give the experimental values of Rvdberg’s correction for the 
highly excited states of the helium atom. The total spin of this atom can have the values 
S = 0 and 1, while the total orbital angular momentum L is, in the states considered, the 


same as the angular momentum / of the excited electron (the other electron being in the 
state ts) Rvdberg’s corrections are 





for S$ = 0: Ap = —0-140, Ay = +0-012, Ay = — 0-0022; 


for S = }: Ay = —0-296, Ay = — 0-068, Ay = —0-0029. 
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product of the two functions, according as we are concerned with states of 
total spint S = Oor S =1. We shall consider the second of these. The 
functions y and 4, can then be regarded as orthogonal. 

Let us try to determine the function of the form (69.2) which is the best 
approximation to the true wave function of the atom. To do so, it is natural 
to start from the variational principle, allowing only functions of the form 
(69.2) to be considered; this method was proposed by V. A. Fok (1930). 

As we know, Schrédinger’s equation can be obtained from the variational 
principle 


f | w*Ayar,aV, = minimum, 
with the additional condition 
ff iy? avan; = 1 


(the integration is extended over the coordinates of both electrons in the 
helium atom). The variation gives the equation 


i j Su*(H—E\y dV,dV, = 0, (69.3) 


and hence, with an arbitrary variation of the wave function y, we obtain the 
usual Schr6édinger’s equation. In the self-consistent field method, the 
expression (69.2) for ys is substituted in (69.3), and the variation is effected 
with respect to the functions 4 and y separately. In other words, we seek 
an extremum of the integral with respect to functions y% of the form (69.2); 
as a result we obtain, of course, an inexact eigenvalue of the energy and an 
inexact wave function, but the best of the functions that can be represented 
in this form. 
The Hamiltonian for the helium atom 1s of the form|| 


H = A,+H,+1/ro, Ĥ, = —$/A,—2/n, (69.4) 
where r is the distance between the electrons. Substituting (69.2) in (69.3), 


carrying out the variation, and equating to zero the coefficients of dy, and oy, 
in the integrand, we easily obtain the following equations: 


[4A 4+2/r+E—H2,.— Goo(r) (7) + [F+ Gio(7) Yolr) = 0, 
[AA +2/r+ E— Hy, —G,,(r)}ho(7)+[Fiet Gicl7) (7) = 9, 


where 


i (69.5) 


Gar(71) = f Wal72)¥o(r2) dV 4/r1., 


He = f tel—4A—2irn dV (a,b = 1, 2) (69.6) 


+ The states of the helium atom with S = 0 are usually called parahelzum states, and those 
with S = 1 orthohelzum states. 

t The wave functions Y, Ya ... of the various states of the electron which are obtained by 
the self-consistent field method are not in general orthogonal, since they are solutions of 
different equations, not of the same equation. In (69.2), however, without altering the function 
y of the whole atom, we can replace Y% by Ye = ¥2+tconstant x y}; by an appropriate 
choice of the constant, we can always ensure that Y, and pz’ are orthogonal. 

l In this section (including the Problems) we use atomic units. 
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These are the final equations resulting from the self-consistent field method; 
they can, of course, be solved only numerically.t 

The equations are similarly derived in more complex cases. The wave 
function of the atom to be substituted in the integral in the variational 
principle is in the form of a linear combination of products of the wave func- 
tions of the individual electrons. This combination must be so chosen that, 
firstly, its permutational symmetry corresponds to the total spin S of the 
state of the atom considered and, secondly, it corresponds to the given value 
of the total orbital angular momentum L of the atom.] 

By using, in the variational principle, the wave function having the neces- 
sary permutational symmetry, we automatically take account of the exchange 
interaction of the electrons in the atom. Simpler equations (though leading 
to less accurate results) are obtained if we neglect the exchange interaction 
and also the dependence on L of the energy of the atom for a given electron 
configuration (D. R. Hartree 1928). As an example, let us again consider the 
helium atom; we can then write the equations for the wave functions of the 
electrons immediately in the form of ordinary Schrédinger’s equations: 


[t}ActEo—Va(ra) ala) = 0 (2 = 1,2), (69.7) 


where V, is the potential energy of one electron moving in the field of the 
nucleus and in that of the distributed charge of the other electron: 


Vlr) = Hnt | Urdhra) Vos (69.8) 


and similarly for V,. In order to find the energy E of the whole atom, we 
must notice that, in the sum E, + E, the electrostatic interaction between the 
two electrons is counted twice, since it appears in the potential energy V;,(r,) 
of the first electron and in that—V,(r,)—of the second. Hence E is obtained 
from the sum E; +E, by subtracting once the mean energy of this interaction; 
that is. 


E = E+E || (rhdh) VdV (69.9) 


To refine the results obtained by this simplified method, the exchange 
interaction and the dependence of the energy on L can afterwards be taken 
into account as perturbations. 


PROBLEMS 


PROBLEM 1. Determine approximately the energy of the ground level of the helium atom 
and helium-like ions (a nucleus of charge Z and two electrons), regarding the interaction 
between the electrons as a perturbation. 





+ A comparison of the energy levels of light atoms, calculated by the self-consistent field 
method, with spectroscopic data enables us to estimate the accuracy of the method at about 
> per cent, and in some cases even better. For complex atoms, however, the error may be- 
come comparable with the intervals between adjacent levels, and hence give an incorrect 
sequence of levels. 


l t An account of general methods of constructing wave functions for a system of electrons 
in a central field is given in Kaplan’s book quoted in §63. 
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SOLUTION. In the ground state of the ion, both electrons are in s states. The unperturbed 


value of the energy is twice the ground level of a hydrogen-like ion (because of the two 
electrons): 


EO = 2(—4Z2) = -22 


The correction in the first approximation is given by the mean value of the electron inter- 
action energy in a state with wave function 


3 





en Ziritra) (1) 


Y = Ya(ri)fe(r2) = 


7 


ithe product of two hydrogen functions with Z = 0). The integral 


i 
EY = { f pe——iVdh 
712 


is most simply calculated as 


© r, 
j 
ED = 2| AV pfd Vao, d= 4rr\2dr1, 
r2 
0 


0 
dV2 = 4nr2*dro, 


the energy of the charge distribution pe = | ¢2 |? in the field of the spherically symmetric 
distribution p, = |42; the integrand with dl’, is the energy of the charge p.(r2) in the field 
of the sphere r, < ra, and the factor 2 takes account of the contribution from configurations 
in which 7; > ro. Thus we find EM = 52/8, and finally 


E = EEV = — 72487. 


For the helium atom (Z = 2) this gives —E = 11/4 = 2-75; the actual value of the ground- 
state energy of this atom is —E = 2-90 atomic units = 78-9 eV. 


PROBLEM 2. The same as Problem 1, Lut using the variational principle, approximating 
the wave function by a product of two hydrogen functions with some effective nuclear charge. 


SOLUTION. We calculate the integral 


Z 24.1 
f [yarara H = -k(Ai+ A) -—- — 


r) 72 Pye 


with the function 4 given by (1), Problem 1, but with Zeff instead of Z. The integral of {7/712 


is calculated as in Problem 1; the integral of /A,¥ can be reduced to that of ¥?/r;, since, by 
Schrédinger’s equation, 


Zen 
(Jars Faa = — Zenn. 
1 
The result is 


[JuAvaria Vo =Zen®—2ZZent Zen- 


This expression as a function of Zert has a minimum at Zett = Z— fr. The corresponding 
value of the energy is 


E= —(Z-%)} 


For the helium atom this gives —E = 2-85. a 

It may be noted that the wave function (1) with the above value of Zerr is in fact the best 
not only of all functions of the form (1) but of all functions which depend only on the sum 
ry + To. ` 
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§70. The Thomas-Fermi equation 


Numerical calculations of the charge distribution and field in the atom by 
the self-consistent field method are extremely cumbersome, especially for 
complex atoms. For these, however, there is another approximate method, 
whose value lies in its simplicity; its results are admittedly much less accurate 
than those of the self-consistent field method. 

The basis of this method (E. Fermi, and L. Thomas, 1927) is the fact that, 
in complex atoms with a large number of electrons, the majority of the elec- 
trons have comparatively large principal quantum numbers. In these condi- 
tions the quasi-classical approximation is applicable. Hence we can apply 
the concept of ‘‘cells in phase space” (§48) to the states of the individual 
electrons. 

The volume of phase space corresponding to electrons which have momenta 
less than p and are in the volume element dV of physical space is rp? dV. 
The number of cells, i.e. possible states, corresponding to this volume isT 
4p? dV/3(27)3, and in these states there cannot at any one time be more than 

3 3 
trp r s e? dV 
3(27)8 3r? 





electrons (two electrons, with opposite spins, in each cell). In the normal 
state of the atom, the electrons in each volume element dV must occupy (in 
phase space) the cells corresponding to momenta from zero up to some maxi- 
mum value py. Then the kinetic energy of the electrons will have its smallest 
possible value at every point. If we write the number of electrons in the 
volume dV as ndV (where n is the number density of electrons), we can say 
that the maximum value fo of the momenta of the electrons at every point 
is related to n by 


Peon = n. 


The greatest value of the kinetic energy of an electron at a point where the 
electron density is » is therefore 


tp = 4(3n'n)Ph, (70.1) 


Next, let ¢(r) be the electrostatic potential, which we suppose zero at 
infinity. The total energy of the electron is 4p?—¢. It is evident that the 
total energy of each electron must be negative, since otherwise the electron 
moves off to infinity. We denote the maximum value of the total energy of 
the electron at each point by —¢,, where dy, is a positive constant; if this quan- 
tity were not constant, the electrons would move from points with smaller 
$o to those with greater d,. Thus we can write 


tio” = $— ġo. (70.2) 


+ In this section we use atomic units. 
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Equating the expressions (70.1) and (70.2), we obtain 


n = [2¢—4,) 2/302, (70.3) 


a relation between the electron density and the potential at every point in the 
atom. 

For ¢ = ġo the density n vanishes; n must clearly be put equal to zero also 
in the whole of the region where ġ < ġo, and where the relation (70.2) would 
give a negative maximum kinetic energy. Thus the equation ¢ = ¢, deter- 
mines the boundary of the atom. ‘There is, however, no field outside a 
centrally symmetric system of charges whose total charge is zero. Hence we 
must haved = Oat the boundary of a neutral atom. It follows from this that, 
for a neutral atom, the constant ¢, must be put equal to zero. On the other 
hand, ¢, is not zero for an ion. 

Below we shall consider a neutral atom, putting accordingly ġo = 0. 
According to Poisson’s electrostatic equation, we have ^g = 477; substitut- 
ing (70.3) in this, we obtain the fundamental equation of the Thomas—Fermi 
method: 


Ad = (84/2/30) 43/2. (70.4) 


The field distribution in the normal state of the atom is determined by the 
centrally symmetric solution of this equation that satisfies the following 
boundary conditions: for r + 0 the field must become the Coulomb field of 
the nucleus, i.e.¢dr > Z, while for r > œ we must haved¢dr > 0. Introducing 
here, in place of the variable r, a new variable x according to the definitions 


r= xbZ-8, b = }(łr}?P = 0-885, (70.5) 


and, in place of ¢, a new unknown function y by t 











Z frzn Z4 y(x) 
d(r) = =x( ) = ; (70.6) 
r b O° eX 
we obtain the equation 
x12 d?yjdx? = 8/2, (70.7) 


with the boundary conditions y = 1 for x = 0 and y = 0 for x = œ. This 
equation contains no parameters, and thus defines a universal function x(x). 
Table 2 gives values of this function obtained by numerical integration of 
equation (70.7). The function y(x) decreases monotonically, and vanishes 
only at infinity.t In other words, the atom has no boundaries in the Thomas- 
Fermi model, and formally extends to infinity. 


+ In ordinary units, 
olr) = (Ze/r)y(r Z'/8me?/0-885h”). 


t The equaticn (70.7) has the exact solution x(x) = 144x-8, which vanishes at infinity 
but does not satisfy the boundary condition at x = 0. It could be used as an asymptotic 
expression for the function x(x) for large x. However, this expression gives fairly exact values 
only for very large x, whilst the Thomas-Ferm equation becomes inapplicable at large 
distances (see below). 
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TABLE 2 


Values of the function x(x) 















1-4 6 0-0594 

1-6 7 0-0461 

1-8 8 0:0366 

2 9 0-0296 

222 10 0-0243 

; 2-4 11 0-0202 
0-2 0-793 2-6 12 0-0171 
03 0-721 2-8 13 0-0145 
0-4 0-660 3-0 14 0-0125 
0-5 0-607 oe 15 0-0108 
0-6 0-561 3-4 20 0:0058 
0:7 0-521 3-6 25 0-0035 
0-8 0-485 3-8 30 0-0023 
ga 0:453 +-0 +0 0-0011 
1:0 0-424 +5 50 0-00063 
jt 0:374 5:0 60 0-00039 

The value of the derivative y’(x) for x = 0 is x'(0)} = —1-59. Hence, as 


x +0, the function y(x) is of the form y ~ 1-—1-59x, and accordingly the 
potential g(r) is 


$(r) = Zr —1-80Z4/3, (70.8) 


The first term is the potential of the field of the nucleus, while the second 
is the potential at the origin due to the electrons. 

Substituting (70.6) in (70.3), we find for the electron density an expression 
of the form 


n = Z3f(rZ¥3/b), f(x) = (32/913)(x/x). (70.9) 


We see that, in the Thomas—Fermi model, the charge density distribution in 
different atoms is similar, with Z-1"3 as the characteristic length (in ordinary 
units H?/me?Z1/3, i.e. the Bohr radius divided by 213). If we measure 
distances in atomic units, the distances at which the electron density has its 
maximum value are the same for all Z. Hence we can say that the majority 
of the electrons in an atom of atomic number Z are at distances from the 
nucleus of the order of Z-!/3. A numerical calculation shows that half the 
total electron charge in an atom lies inside a sphere of radius 1-33Z-¥. 
Similar considerations show that the mean velocity of the electrons in the 


atom (taken, as an order of magnitude, as the square root of the energy) is of 
Me order olsi >. 


The Thomas-Fermi equation becomes inapplicable both at very small and 
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at very large distances from the nucleus. Its range of applicability for small r 
is restricted by the inequality (49.12); at smaller distances the quasi-classical 
approximation becomes invalid in the Coulomb field of the nucleus. Putting 
in (49.12) « = Z, we find 1/Z as the lower limit of distance. ‘The quasi- 
classical approximation becomes invalid for large r also in a complex atom. 
In fact, it is easy to see that, for r ~ 1, the de Broglie wavelength of the elec- 
tron becomes of the same order of magnitude as the distance itself, so that the 
quasi-classical condition is undoubtedly violated. This can be seen by esti- 
mating the values of the terms in equations (70.2) and (70.4); indeed, the 
result is obvious without calculation, since equation (70.4) does not involve 
Z. Thus the applicability of the Thomas-Ferm1 equation is limited to dis- 
tances large compared with 1/Z and small compared with unity. In complex 
atoms, however, the majority of the electrons in fact lie in this region. 

This means that the “outer boundary” of the atom in the Thomas—Ferm1 
model is at r~1, i.e. the dimensions of the atom do not depend on Z. The 
energy of the outer electrons, i.e. the ionization potential of the atom, ts 
likewise independent of Z.t 

By means of the Thomas—Fermi method we can calculate the total ioniza- 
tion energy E, i.e. the energy needed to remove all the electrons from the 
neutral atom. To do this, we must calculate the electrostatic energy of the 
Thomas—Fermi distribution for the charges in the atom; the required total 
energy is half this electrostatic energy, since the mean kinetic energy in a 
system of particles interacting in accordance with Coulomb’s law is (by the virial 
theorem; see Mechanics, §10) minus half the mean potential energy. The 
dependence of E on Z can be determined a priori from simple considerations: 
the electrostatic ~:ergy of Z electrons at a mean distance Z~1/3 from a 
nucleus of charge Z, and moving in tts field, is proportional to Z- Ziz i= 
Z733. A numerical calculation gives the result E = 20-8273 eV. The 
dependence on Z is in good agreement with the experimental data, though 
the empirical value of the coefficient is close to 16. 

We have already mentioned that positive (non-zero) values of the constant 
do correspond to ionized atoms. If we define the function y by¢—¢o = Zy/r, 
we obtain the same equation (70.7) for y as previously. We must now, how- 
ever, consider only solutions which vanish not at infinity as for the neutral 
atom, but for finite values x, of x. Such solutions exist for any xg. At the 
point x = x,, the charge density vanishes together with y, but the potential 
remains finite. The value of xo is related to the degree of ionization in the 
following manner. The total charge inside a sphere of radius v is, by Gauss’s 
theorem, —1206/8r = Z[y(x)~—xx'(x)]. The total charge z on the ion is 
obtained by putting x = x, in this; since x(x )) = 0, we have 


z = —Zxox'(%)- (70.10) 


+ This model does not, of course, show the periodic dependence of the dimensions and 
ionization potential of the atom on Z, which appears in the periodic system of the elements. 
Moreover, experimental data indicate the existence of a shght but steady increase in dimen- 
sions and decrease in the ionization potential as Z increases. 
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The thick line in Fig. 23 shows the curve of x(x) for a neutral atom; below 
it are two curves for ions of different degrees of ionization. The quantity 
z/Z is shown graphically by the length of the segment intercepted on the 
axis of ordinates by the tangent to the curve at x = Xp. 








Equation (70.7) also has solutions which are nowhere zero; these diverge 
at infinity. They can be regarded as corresponding to negative values of the 
constant ġo. Figure 23 also shows two such curves of x(x); they lie above the 
curve for the neutral atom. At the point x = x,, where 


x(%1)—*1x"(%1) = 0, (70.11) 


the total charge inside the sphere x < x, is zero (graphically, this point is 
evidently the one where the tangent to the curve passes through the origin). 
If we cut off the curve at this point, we can say that it defines x(x) for a neutral 
atom at whose boundary the charge density remains non-zero. Physically, 
this corresponds to a “compressed? atom confined to some given finite 
volume.t 

The Thomas—Fermi equation does not take account of the exchange inter- 
action between electrons. The effects which this involves are of the next 
order of magnitude with respect to Z-?/3. Hence an allowance for the ex- 
change interaction in the Thomas—Fermi method requires a simultaneous 
consideration of both these effects and others of the same order of magnitude. f 


PROBLEM 


Find the relation between the energy of the electrostatic interaction between electrons 


and that of their interaction with the nucleus in a neutral atom, using the Thomas—Fermi 
model. 


SOLUTION. The potential ġe of the field due to the electrons is found by subtracting the 
potential Z.r of the nucleus from the total potential ¢. The energy of the interaction between 





t This approach may be useful in studying the equation of state of highly compressed 
matter. 


t This has been done by A. S. Kompaneets and E. S. Pavlovskii (Soviet Physics JETP 4 
328, 1957) and by D. Kirzhnits (ibid. 5, 64, 1338, 1957). ( rysics J 
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the electrons is therefore 


Use = —4]dendV 


ay [~av-3 [ grav 





3n2)2.3 
he [assar 
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(where ¢ has been expressed in terms of n by means of (70.3)). The energy Uen of the 
interaction between the electrons and the nucleus and their kinetic energy T are therefore 


7 
Uen = -zfZa F, 
7 


Pe 
T= 2f fap? 4np*dpdV 
(27h)? 


(372)2/8 
= 3 [ assay, 





Comparing these expressions with the previous equation, we find 
Cee = —48Uen—$T. 


According to the virial theorem (see Mechanics, §10), for a system of particles interacting 
according to Coulomb’s law we have 2T = —U = — Uen— Uee. Thus finally 


Uee =a =} Uen. 


§71. Wave functions of the outer electrons near the nucleus 


We have seen, on the basis of the Thomas—fermi model, that the outer 
electrons in complex atoms (Z large) are mainly at distances r ~ 1 from the 
nucleus.t A number of properties of atoms, however, depend significantly 
on the electron density near the nucleus; such properties will be considered 
in §§72 and 120. To determine the order of magnitude of this density we 
may examine the variation of the wave function (7) of the electron in the 
atom when r varies from large (r ~ 1) to small distances. 


In the region r ~ 1, the field of the nucleus is screened by the remaining 
electrons, so that the potential energy U(r) ~ 1/r ~ 1. The energy of the 
electron level in this field E ~ 1. At distances of the order of the Bohr 
radius in the field of a charge Z, r ~ 1/Z, the field of the nucleus may be 
regarded as unscreened, and U = — Z/r. In the transitiona! region, 1/Z < r 
< 1, the potential energy | U | is large compared with the electron energy E, 


and the condition 
d/l d ] 
See 
dy p dr ai | U | 


t In this section we use atomic units. 
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holds (where p is the momentum), so that the motion of the electron is quasi- 
classical. The spherically symmetrical quasi-classical wave function is 


1 1 
Wl a E a 71.1) 
the order of magnitude of the coefficient (~ 1) being determined by the con- 
dition ù ~ 1 for “joining” to the wave function forr ~ 1. 
Applying the expression (71.1) in order of magnitude for r ~ 1/Z (sub- 
stituting U = —Z/r), we obtain the required value of the wave function 
near the nucleus:} 


JZ) ~4/Z. (71.2) 


In accordance with the general properties of wave functions in a central 

field (§32), when the distance decreases further ẹ(r) either remains constant 

an order of magnitude (for an s electron) or begins to decrease (for / # 0). 
The probability of finding the electron in the region r < 1/Z is 


w~ | yp |278~ 1/22. (71.3) 


The formulae (71.2) and (71.3) of course determine only the systematic 
variation with increasing Z, and do not take into account non-systematic 
variations from one element to the next. 


§72. Fine stricture of atomic levels 


A consistent derivation of the formulae for relativistic effects in the inter- 
action of electrons belongs to the next volume (see ROT, §§33 and 83). 
Here we shall give only a general account of these effects as they relate to 
atomic terms. It is found that the relativistic terms in the Hamiltonian of 
an atom fall into two classes. One of these contains terms linear with respect 
to the spin operators of the electrons, while the other includes quadratic 
terms. The former correspond to the interaction between the orbital 
motion of the electrons and their spin (this interaction 1s called spin—orbit 
interaction), while the latter correspond to the interaction between the 
spins of the electrons (spin-spin interaction). Both interactions are of the 
same order (the second) with respect to v/c, the ratio of the velocity of the 
electrons to that of light; in practice, the spin-orbit interaction considerably 
exceeds the spin-spin interaction in heavy atoms. This is because the spin- 
orbit interaction increases rapidly with the atomic number, whereas the 
spin-spin interaction is essentially independent of Z (see below). 

The spin-orbit interaction operator is of the form 


Vea = z ALS, (72.1) 
(the summation being over all the electrons in the atom), where 8, are the 


t To determine the coefficient in this formula (when the wave function is known in the 
region r ~ 1), we should have to use the expression (36.25) in the range r & 1/Z. 
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spin operators of the electrons, and A, are some “orbital” operators, i.e. opera- 
tors acting on functions of the coordinates. In the self-consistent field 
approximation the operators Aj are proportional to the operators Ía of the 
orbital angular momentum of the electrons, and V5; can then be written in 
the form 


Voy = Catala . Sa. (72.2) 
The coefficients in the sum are given in terms of the potential energy U(r) 
of the electron in the self-consistent field by 
h2? dU(re) 
2mcrg dra l (72.3) 





Xa = 


Since | U(r)| decreases away from the nucleus, all the aq > 0. 

Regarding the interaction (72.2) as a perturbation, we should, in order to 
calculate the energy, average it with respect to the unperturbed state. The 
main contribution to the energy 1s given by distances close to the nucleus, 
of the order of the Bohr radius (~A?/Zme?) for a nucleus with charge Ze. 
In this region the field of the nucleus 1s almost unscreened and the potential 
energy is 

| U(r) |~ Ze? /r ~ Z2met h2, 
so that 
a~ h UJm?c?r? 


~ LZ 4(e2/hic)2mes /h*. 


The mean value of « is obtained by multiplying by tt.c probability w of 
finding the electron near the nucleus. According to (71.3), w~ Z~*, so that 
we have finally that the energy of the spin—orbit interaction of the electron is 


given by 
s Ze ) me’ 
a~ : 
( hic he 








i.e. differs from the fundamental energy of the outer electrons in the atom 
(~ me4/h®) only by the factor (Ze?/Ac)*. This factor increases rapidly with 
the atomic number, and reaches values of the order of unity in heavy atoms. 

The actual averaging of the operator (72.2) over the unperturbed states 
of the electron envelope is done in two steps. First of all, we average over 
electron states of the atom with given values L and S of the total orbital 
angular momentum and spin, but not with given directions of these. After 
this averaging P is still an operator, which, however, we must now express 
only in terms of operators of quantities characterizing the atom as a whole, 
not its individual electrons. These are the operators L and S. We denote by 
Vs, the operator of the spin-orbit interaction thus averaged. Being linear 
in §, it has the form 


AANE T (72.4) 
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where A is a constant characterizing a given (unsplit) term, i.e. depending 
on Sand L but not on the total angular momentum J of the atom.t 

To calculate the energy of the splitting of a degenerate level, we must 
now solve the secular equation formed from the matrix elements of the 
operator (72.4). In this case, however, we already know the correct functions 
in the zeroth approximation, in which the matrix of Vsz is diagonal. These 
are the wave functions of states with definite values of the total angular 
momentum J. _The averaging with respect to such a state involves replacing 
the operator S.L by its eigenvalues, which, according to (31.3), are 


L.S =H J(J+1)—L(L4+1)—S(S+1)]. 


Since the values of L and S are the same for all the components of a multiplet, 
and we are interested only in their relative position, we can write the energy 
of the multiplet splitting in the form 


tA] +1). (72.5) 


The intervals between adjacent components (with numbers J and J—1) are 
consequently 
es Sai. (72.6) 


This formula gives what is called Lande’s interval rule (1923). 

The constant A can be either positive or negative. For A > 0 the lowest 
component of the multiplet level is the one with the smallest possible J, i.e. 
J = |L—S}|; such multiplets are said to be normal. If A < 0, on the other 
hand, the lowest level of the multiplet is that with J = L+S; these multi- 
plets are said to be inverted. 

It is easy to determine the sign of A for the normal states of atoms if the 
electron configuration is such that there is only one shell not completely 
filled. If this shell is not more than half filled, then according to Hund’s 
rule (§67) all z electrons in it have parallel spins, so that the total spin has 
the greatest possible value, S = in. Substituting in (72.2) sz = S/n and 
taking aq (which is the:same for all electrons in a given shell) outside the 
sum we obtain 


Vsz = (2/25) .L, 


i.e. A = o/2S>0. If the shell is more than half full, we first add and sub- 





t In order to clarify the meaning of this operation, it may be noted that averaging in 
quantum mechanics has the general significance of taking the appropriate diagonal matrix 
element. A partial averaging consists in taking a set of matrix elements that are diagonal with 
respect to only some of the quantum numbers describing the state of the system. For example, 
in this case the averaging of the operator (72.2) denotes the construction of a matrix with 
elements nM’ iM’ s|VsilnMtMs> with all possible H, M'L and Ms, M's and diagonal with 
respect to al] the other, quantum numbers (the assembly of which we denote by n). Correspond- 
ingly, the operators S and L are to be regarded as matrices <M‘s|S|Ms> and <LM'uLIMey, 


whose elements are given by (27.13). A similar device of stepwise averaging will be needed 
ın several subsequent treatments. 
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tract in (72.2) the same sum taken over the unoccupied places or holes in 
the incomplete shell. Since, for a completely filled shell, we should have 
V.. = 0, the operator Psi is thereby represented as a sum 


Vs = ey Dy Xa 1,80, 


taken only over the holes, the total spin and orbital angular momentum 
of the atom being S = — È sa, L = — È la. By the same method as pre- 
viously we therefore find A = —«/2S, i.e. A < 0. 

From the above we have a simple rule which gives the value of J in the 
normal state of an atom with one incompletely filled shell. If this shell 
contains not more than half the greatest possible number of electrons for 
that shell, then J = | L— S |; if the shell is more than half full, J = L + S. 

As already mentioned, the spin-spin interaction, unlike the spin-orbit 
interaction, is essentially independent of Z. This is evident from the fact 
that it is a direct interaction between electrons and does not involve the field 
of the nucleus. 

For the averaged spin-spin interaction operator we should obtain, analog- 
ously to formula (72.4), an expression quadratic in S. The expressions S? 
and (§.L)? are quadratic in $. The former has eigenvalues independent 
of J, and therefore does not give any splitting of the term. Hence it can be 
omitted, and we can write 


Vss = BS.L), (27) 


where B is a constant. The eigenvalues of this operator conta:n terms inde- 
pendent of J, terms proportional to J(J+1), and finally a term proportional 
to J J+1)%. The first of these do not give any splitting and hence are without 
interest; the second can be included in the expression (72.5), which simply 
means a change in the constant A. Finally, the last term gives an energy 


tBJ*(J +-1)°. (72.8) 


The scheme for the construction of the atomic.levels discussed in §§66-67 
is based on the supposition that the orbital angular momenta of the electrons 
combine to give the total orbital angular momentum L of the atom, and their 
spins to give the total spin S. As has already been mentioned, this supposi- 
tion is legitimate only when the relativistic effects are small; more exactly, 
the intervals in the fine structure must be small compared with the differences 
between levels with different L and S. This approximation is called the 
Russell-Saunders case (H. N. Russell and F. A. Saunders 1925), and we 
speak also of LS coupling. : 

In practice, however, this approximation has a limited range of applica- 
bility. The levels of the light atoms are arranged in accordance with the LS 
model, but as the atomic number increases the relativistic interactions in the 
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atom become stronger, and the Russell~Saunders approximation becomes 
inapplicable.t¢ It must also be noticed that this approximation is, in parti- 
cular, inapplicable to highly excited levels, in which the atom contains an 
electron which is in a state with large n, and which is therefore mainly at large 
distances from the nucleus (see §68). The electrostatic interaction of this 
electron with the motion of the other electrons is comparatively weak, but 
the relativistic interaction in the rest of the atom is not diminished. 

In the opposite limiting case the relativistic interaction is large compared 
with the electrostatic (or, more precisely, compared with that part of it which 
governs the dependence of the energy on L and S}. In this case we cannot 
speak of the orbital angular momentum and spin separately, since they are 
not conserved. The individual electrons are characterized by their total 
angular momenta j, which combine to give the total angular momentum J of 
the atom. This scheme of arrangement of the atomic levels is called jj coupl- 
img. In practice, this coupling is not found in the pure state, but various types 
of coupling intermediate between LS and jj are observed among the levels of 
very heavy atoms. ft 

A peculiar type of coupling is observed in certain highly excited states. 
Here the rest of the atom may be in a Russell-Saunders state, i.e. may be 
characterized by the values of L and S, while its coupling with the highly 
excited electron is of the jj type; this is again due to the weakness of the elec- 
trostatic interaction for this electron. 

The fine structure of the energv levels of the hydrogen atom has certain 
characteristic properties. It will be calculated exactly in ROT (§34), but here 
we shall only mention that, for a given principal quantum number, the energy 
depends only on the total angular momentum j of the electron. Thus the 
degeneracy of the levels is not completely removed; to a level with given 
n and j there correspond two states with orbital angular momenta / = j + 4 
(unless 7 has the value n—4, which is the greatest possible for a given n). 
Thus the level with n = 3 is split into three levels, of which the states 
51:2, Pi'2 Correspond to one, p3,2 and d3/2 to another, and ds; to the third. 


§73. The Mendeleev periodic system 


The elucidation of the nature of the periodic variation of properties, ob- 
served in the series of elements when they are placed in order of increasing 
atomic number (D. I. Mendeleev 1869), requires an examination of the 
peculiarities in the successive completion of the electron shells of atoms. 
The theory of the periodic system is due to N. Bohr (1922). 

When we pass from one atom to the next, the charge is increased by unity 





t Nevertheless, although the quantitative formulae which describe this type of coupling 
become inapplicable, the method of classifying levels according to this scheme may itself 


remain meaningful for heavier atoms, especially for the lowest states (including the normal 
state), 


| For further details regarding types of cou 
see, for instance. E. U. Condon and G. H. Sh 
University Press. 1935. 


pling and the quantitative aspect of the problem, 
ortley, The Theory of Atomic Spectra, Cambridge 
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and one electron is added to the envelope. At first sight we might expect 
the binding energy of each of the successively added electrons to vary 
monotonically as the atomic number increases. The actual variation, how- 
ever, is entirely different. 

In the normal state of the hydrogen atom there is only one electron, in 
the ls state. In the atom of the next element, helium, another ls electron is 
added; the binding energy of the 1s electrons in the helium atom is, however, 
considerably greater than in the hydrogen atom. This is a natural conse- 
quence of the difference between the field in which the electron moves 
in the hydrogen atom and the field encountered by an electron added to the 
Het ion. At large distances these fields are approximately the same, but 
near the nucleus with charge Z = 2 the field of the Het ion is stronger than 
that of the hydrogen nucleus with Z = 1. In the lithium atom (Z = 3), the 
third electron enters the 2s state, since no more than two electrons can be 
in Is states at the same time. For a given Z the 2s level lies above the 1s 
level; as the nuclear charge increases, both levels become lower. In the 
transition from Z = 2 to Z = 3, however, the former effect is predominant, 
and so the binding energy of the third electron in the lithium atom is con- 
siderably less than those of the electrons in the helium atom. Next, in the 
atoms from Be (Z = 4) to Ne (Z = 10), first one more 2s electron and then 
six 2p electrons are successively added. The binding energies of these 
electrons increase on the average, owing to the increasing nuclear charge. 
The next electron added, on going to the sodium atom (Z = 11), enters 
the 3s state, and the binding energy again diminishes markedly, since the 
effect of going to a higher shell predominates over that of the increase of the 
nuclear charge. 

This picture of the filling up of the electron envelope is characteristic of the 
whole sequence of elements. All the electron states can be divided into 
successively occupied groups such that, as the states of each group are occu- 
pied in a series of elements, the binding energy increases on the average, but 
when the states of the next group begin to be occupied the binding energy 
decreases noticeably. Figure 24 shows those ionization potentials of elements 
that are known from spectroscopic data; they give the binding energies of 
the electrons added as we pass from each element to the next. 

The different states are distributed as follows into successively occupied 
groups: 


Is 2 electrons 

2s, 2p 8 z 

3s, 3p O t 

4s, 3d, 4p He ay \ (73.1) 
5s, 4d, 5p ice a 

6s, 4f, 5d, 6p — 


7s, 6d, 5f... . 
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The first group is occupied in H and He; the occupation of the second and 
third groups corresponds to the first two (short) periods of the periodic 
system, containing 8 elements each. Next follow two long periods of 18 ele- 
ments each, and a long period containing the rare-earth elements and 32 
elements in all. The final group of states is not completely occupied in the 
natural (and artificial transuranic) elements. 

To understand the variation of the properties of the elements as the states 
of each group are occupied, the following property of d and f states, which 
distinguishes them from s and p states, is important. The curves of the effec- 
tive potential energy of the centrally symmetric field (composed of the electro- 
static field and the centrifugal field) for an electron in a heavy atom have a 
rapid and almost vertical drop to a deep minimum near the origin; they then 
begin to rise, and approach zero asymptotically. Fors and p states, the rising 
parts of these curves are very close together. This means that the electron 
is at approximately the same distance from the nucleus in these states. The 
curves for the d states, and particularly for the f states, on the other hand, 
pass considerably further to the left; the classically accessible region which 
they delimit ends considerably closer in than that for the s and p states 
with the same total electron energy. In other words, an electron in the d and f 
states is mainly much closer to the nucleus than in the s and $ states. 

Many properties of atoms (including the chemical properties of elements; 
see §81) depend principally on the outer regions of the electron envelopes. The 
above characteristic of the d and f states is very important in this connection. 
Thus, for instance, when the 4f states are being filled (in the rare-earth ele- 
ments; see below), the added electrons are located considerably closer to the 
nucleus than those in the states previously occupied. As a result, these 
electrons have practically no effect on the chemical properties, and all the 
rare-earth elements are chemically very similar. 

The elements containing complete d and f shells (or not containing these 
shells at all) are called elements of the principal groups; those in which the 
filling up of these states is actually in progress are called elements of the inter- 
mediate groups. ‘These groups of elements are conveniently considered separ- 
ately. 

Let us begin with the elements of the principal groups. Hydrogen and 
helium have the following normal states: 


yi: Is Si oHe : 152 1S0 


(the number with the chemical symbol always signifies the atomic number). 
The electron configurations of the remaining elements of the principal groups 
are shown in Table 3. 
In each atom, the shells shown on the right of the table in the same line 
and above are completely filled. The electron configuration in the shells 
that are being filled is shown at the top, while the principal quantum number 
of the electrons in these states is shown by the figure on the left of the table 
in the same line. The normal states of the whole atom are shown at the bot- 
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tom. Thus, the aluminium atom has the electron configuration 1s? 2s? 2p8 3s? 
3p *Pr/2. 

The values of L and S in the normal state of the atom can be determined 
(the electron configuration being known) by means of Hund’s rule (§67), and 
the value of J is determined by the rule given in §72. 


TABLE 3 


Electron configurations of the atoms of elements in the principal groups 





A 5? stp s2p1 stp? stpt stp’ stps 
n= 2 3Li 4aBe 5B 6C aN 80 9F 190Ne 1s? 

3 11Na 12Mg 132Al 1451 isP 109 17Cl 18Àr Zee 2p" 
4 9K 90Ca 36) 3p" 
4 o9Cu 302n 31Ga 32Ge 33S 3456 35Br 36Kr 3q'0 

5 Rb 385r 453 4p* 
5 a7Ag asCd 49In 599Nn 515b sole 531 54 XE 4d!1° 

6 s5sCs  seBa Ss? Sp® 
6 7Au solig grill soPb 83B1 saPo asAt s6Rn 4f'4 Sd"? 
7 Fr asRa 6s? 6p* 


*Sij So ‘Pi *Po “Sap ‘P: Psa "Sg 


The atoms of the inert gases (He, Ne, Ar, Kr, Xe, Rn) occupy a special 
position in the table: the filling up of one of the groups of states listed in 
(73.1) is completed in each of them. Their electron configuraticis have 
unusual stability (their ionization potentials are the greatest in their respective 
series). This causes the chemical inertness of these elements. 

We see that the occupation of different states occurs very regularly in the 
series of elements of the principal groups: first the s states and then the p 
states are occupied for each principal quantum number n. The electron 
configurations of the ions of these elements are also regular (until electrons 
from the d and f shells are removed in the ionization): each ion has the con- 
figuration corresponding to the preceding atom. Thus, the Mgt ion has 
the configuration of the sodium atom, and the Mg++ ion that of neon. 

Let us now turn to the elements of the intermediate groups. The filling 
up of the 3d, 4d, and 5d shells takes place in groups of elements called 
respectively the iron group, the palladium group and the platinum group. 
Table 4 gives those electron configurations and terms of the atoms in these 
groups that are known from experimental spectroscopic data. As is seen from 
this table, the d shells are filled up with considerably less regularity than the 
s and p shells in the atoms of elements of the principal groups. Here a 
characteristic feature is the “competition” between the s and d states. It is 
seen in the fact that, instead of a regular sequence of configurations of the 
type d? s? with increasing p, configurations of the type dp*!s or d?+2 are often 


found. Thus, in the iron group, the chromium atom has the configuration 
3d5 4s, and not 3d4 452; after nickel with 8 d electrons, there follows at once 


the copper atom with a completely filled d shell (and hence we place this 
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TABLE 4 


Electron configurations of the atoms of elements in the iron, palladium and 
platinum groups 





Iron group 


es 


215c 221i 23V eaCr  23Mn  o¢6Fe 27Co 2aNi 
3d 4s? 3d? 4s? 3d% 4s? 3d5 4s 3d? 4s? 3d® 45? 3d? 45? 3d® 45? 
Ar envelope + 
‘D; f2 ot 2 ‘F; /2 "Ss S5 b /2 pEr P 9/2 F å 


Palladıum group 





391 402r aiNb 42Mo a3Te a4Ru asRh a6Pd 





4d 5s? 4d? Ss? 4d 5s 4d 5s 4d55s2 4d? 5s 4d®5s 4d’ 
Kr envelope + 
Dz SF; ‘Di 7S3 Ss SF; ‘Faz "So 





Platinum group 





syLa 


5d 6s? 


Xe envelope + 2n 
3/2 





71Lu 72Hf 21a 74W 7Re 760s 27Ir 78Pt 





Ne envelope Sd 6s? Sd? 6s*? 5d? 6s? 5dt 6s? Sd° 6s? S5d€6s2 5d76s2 Sd? 6s 
4f 
TEE "Dep E 3Faz De Si. 5Da ike "Ds 





element in the principal groups). This lack of regularity is observed in the 
terms of ions also: the electron configurations of the ions do not usually 
agree with those of the preceding atoms. For instance, the V+ ion has the 
configuration 3d4 (and not 3d? 4s? like titanium); the Fet ion has 3d® 4s 
(instead of 3d54s? as in manganese). We may remark that all ions found 
naturally in crystals and solutions contain only d (not s or p) electrons in their 
incomplete shells. Thus iron is found in crystals or solutions only as the 
ions Fet++ and Fe+t++, whose configurations are 3d§ and 3d5 respectively. 

A similar situation occurs in the filling up of the 4f shell; this takes place 
in the series of elements known as the rare earths (Table 5).¢ The filling up 
of the 4f shell also occurs in a slightly irregular manner characterized by the 
competition between 4f, 5d and 6s states. 


+ 1n books on chemistry, lutetium is also usually placed with the rare-earth elements. This, 
however, is incorrect, since the 4f shell is complete in lutetium; 1t must therefore be placed 
in the platinum group, as in Table 4. 
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The last group of intermediate elements begins with actinium. In this 
group the 6d and 5f shells are filled, similarly to what happens in the group 
of rare-earth elements (Table 6). 

To conclude this section, let us examine an interesting application of the 
Thomas—Fermi method. We have seen that the electrons in the p shell first 
appear in the fifth element (boron), the d electrons for Z = 21 (scandium), 
and the f electrons for Z = 58 (cerium). These values of Z can be predicted 
by the Thomas-Fermi method, as follows. 

An electron with orbital angular momentum / in a complex atom moves 
with an ‘“‘effective potential energy’’T of 


Ur) = +i. 


The first term is the potential energy in an electric field described by the 
Thomas~—Fermi potential ¢(7). The second term is the centrifugal energy, 
in which we put (1+ 3)? instead of (+1), since the motion is quasi-ciassical. 
Since the total energy of the electron in the atom is negative, it is clear that, 
if (for given values of Z and /) U,(r) > 0 for all r, there can be no electrons 
in the atom concerned with the given value of the angular momentum l. If 
we consider any definite value of land vary Z, it is found that in fact U(r) > 0 
everywhere when Z is sufficiently small. As Z is increased, a value is reached 
for which the curve of U(r) touches the axis of abscissae, while for larger Z 
there is a region where U(r) < 0. Thus the value of Z at which electrons 
with the given Z appear in the atom is determined by the condition that the 
curve of U,(r) touches the axis of abscissae, i.e. by the equations 


U(r) = $+ =0, Ue) = pOH = 0. 


Substituting here the expression (70.6) for the potential, we obtain the 
equations 
Z?718y(x)/x = (43r) PIE), 


1 ose 
ZL [ xx’ (x)—x(x)]/x = —2(4/3r)? R(1+4})?/x. ; \ ( ) 


Dividing each side of the second equation by the corresponding side of the 
first, we find for x the equation 


x'(x)/x(x) = —1/x, 


and we then calculate Z from the first of equations (73.2). A numerical 
calculation gives 


Z = 0-155(2/+1)8. 


This formula determines the value of Z for which electrons with a given / 
first appear in the atom; the error is about 10 per cent. 


oes accurate values are obtained by taking the coefficient as 0-17 instead 
of 0-155: 


Z = 0-17(21+ 158. (73.3) 


t As in §70. we use atomic units. 
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For / = 1, 2, 3 this formula gives respectively, after rounding to the nearest 
integer, the correct values 5, 21,58. For/ = 4, formula (73.3) gives Z = 124; 
this means that g electrons should first appear only in the 124th element. 


§74. X-ray terms 


The binding energy of the inner electrons in the atom is so large that, if 
such an electron makes a transition into an outer unfilled shell (or is removed 
from the atom), the excited atom (or ion) is mechanically unstable with 
respect to ionization, which is accompanied by the reconstruction of the 
electron envelope and the formation of a stable ion. However, because of the 
comparatively weak interaction between the electrons in the atom, the prob- 
ability of such a transition is comparatively small, so that the lifetime 7 of 
the! excited state is long. Hence the “width” A/7 of the level (see §44) is so 
small that it is reasonable to regard the energies of an atom with an excited 
inner electron as discrete energy levels of ‘“‘quasi-stationary’’ states of the 
atom. These levels are called X-ray terms.+ 

The X-ray terms are primarily classified according to the shell from which 
the electron is removed, or in which, as we say, a hole is formed. Where 
the electron goes has almost no effect on the energy of the atom, and hence 
iS unimportant. 

The total angular momentum of the set of electrons occupying any shell 
is zero. When one electron has been removed, the shell acquires some 
angular momentum j. For the (n, /) shell, the angular momentum j can 
take the values /+}4. Thus we obtain levels which might be denoted by 
Is) jo, 2512, 2P1/e, 2P3/2 --, Where the value of f is added as a-suffix to the 
letter giving the position of the hole. It is usual, however, to employ special 
symbols as follows: 


lsa Zsa Pua Pa Isi IPre IPs Idy Idy 
K Li Lir Im Mo My Mm Mw M 


The levels with n = 4, 5, 6 are similarly denoted by the letters N, O, P. 
Levels with the same n (denoted by the same capital letter) lie close together 
and at a distance from levels with a different n. The reason for this is that, 
owing to the relative nearness of the inner electrons to the nucleus, they are 
in the almost unscreened field of the nucleus, and hence their states are 
hydrogen-like; the energy is, to a first approximation, — Z2/2n2 (in atomic 
units), i.e. depends only on n. If relativistic effects are taken into account, 
terms with different j are separated (cf. the discussion in §72 of the fine 
Structure of the hydrogen levels), such as, for example, Ly and Lyr from Lir, 
and Wr and Mr from Mi and Myry. These pairs of levels are said to be 
relativistic doublets. The separation of terms with different / and the same j 
(for instance L, and Ly, M, and M) is due to the deviation of the fiefd in 





t The name is due to the fact that transition 


s between these levels cause the emission of 
X-rays by the atom. 
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which the inner electrons move from the Coulomb field of the nucleus, 
i.e. to the taking into account of the interaction of the electron with other 
electrons. These are said to be screening doublets. The main correction 
term to the “hydrogen-like”’ energy of the electron results from the potential 
cue to the remaining electrons in the region near the nucleus, and is pro- 
portional to 24/3 (see (70.8)). However, since this correction does not 
depend on either n or l, it does not affect the level spacings. The principal 
correction terms in the level differences are therefore due to the interaction 
of one electron with those adjoining it. Since the distances between the inner 
electrons are r ~ 1/Z (the Bohr radius in the field of a charge Z), the energy 
of this interaction is ~ l/r ~ Z. Taking this correction into account, we can 
write the energy of an X-ray term, to the same accuracy, as —(Z—6)?/2n?2, 
where ô = 6(n,/) is a quantity smal] compared with Z, and may be regarded 
as a measure of the screening of the nuclear charge. 

Terms with two and three holes may exist in the electron shells together 
with the X-ray terms with one hole. Since the spin-orbit interaction is 
strong for the inner electrons, the holes are subject to 77 coupling. 

The width of an X-ray term is determined by the total probability of all 
possible processes by rearrangement of the electron envelope of the atom so 
as to fill the hole in question. In the heavy atoms, transitions of the hole 
from a given shell to a higher one (i.e. electron transitions in the opposite 
direction) are the most important, and are accompanied ‘by the emission of 
X-ray quanta. The probability of these “radiative” transitions, and therefore 
the corresponding part of the level width, increase very rapidly with the 
atomic number (as Z4) but decrease towards higher levels for a given Z. 

For lighter atoms (and higher levels) an important or even predominant 
part is played by radiationless transitions, in which the energy liberated when 
a hole is filled by an electron from above goes to remove another inner 
electron from the atom (called the Auger effect). As a result of this process 
the atom is in a state with two holes. The probabilities of these processes and 
the corresponding contribution to the level width are independent of the 
atomic number toa first approximation with respect to 1/ Z (see the Problem)f. 


PROBLEM 


Find the limiting law of dependence of the Auger width of X-ray terms on atomic number 
when the latter is sufficiently large. 


SoLuTion. The Auger transition probability is proportional to the square of a matrix 
element of the form 


M = | (yeta Viited id Ve, 


where yi, Yz and $i , Y, are the initial and final wave functions of the two electrons involved 
in the transition, and V = e2/ryo is their interaction energy. When Z is sufficiently large, the 
wave functions of the inner electrons may be regarded as hydrogen-like and the screening of 
the field of the nucleus by other electrons may be neglected (the wave function of the ioniza- 


+ As an example it may be mentioned that the Auger width of the K level is about 1 eV, 
and reaches values of the order of 10 eV for higher levels. 
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tion electron is also hydrogen-like in the region within the atom which is of importance in the 
integral M). If we carry out the calculations and all quantities are expressed in Coulomb 
units (with the constant « = Ze*; see §36), then the only quantity in the integral M which 
depends on Z is V = 1/Zriz, so that M ~ 1: Z. The transition probability, and therefore the 
Auger width AE of the level, are proportional to 1/ Z2. On returning to ordinary units (the 
Coulomb unit of energy being Z°me4/fi*), we find that AE is independent of Z. 


§75. Multipole moments 


In classical theory, the electrical properties of a system of particles are 
described by its multipole moments of various orders, expressed in terms of 
the charges and coordinates of the particles. In the quantum theory, the 
definitions of these quantities are the same in form, but they must now be 
regarded as operators. 

The first multipole moment is the dipole moment, defined as the vector 


. d= Ler, 


where the summation is over all the particles, and the suffix which numbers 
the particles is omitted for brevity. The matrix of this operator, like that of 
any polar vector (see §3()), has non-zero elements only for transitions between 
states of different parity. The diagonal elements are therefore always zero. 
In other words, the mean values of the dipole moment of any system of par- 
ticles (e.g. an atom) in stationary states are zero.t 

The same is evidently true of all 2'-pole moments with odd 1. The com- 
ponents of such a moment are polynomials of odd degree lin the coordinates, 
which, like the components of a polar vector, change sign on inversion of 
the coordinates. The same parity selection rule therefore applies. 

The quadrupole moment of a system is defined as the symmetrical tensor 


On = Le(3xyx,—Sizr?), (75.1) 


the sum of whose diagonal terms is zero. The determination of the values of 
these quantities in a particular state of a system (an atom, say) requires an 
averaging of the operator (75.1) over the corresponding wave function. This 
averaging is conveniently carried out in two stages (cf. §72). 

Let Qix denote the quadrupole moment operator averaged over the electron 
States with a given value of the total angular momentum / (but not of its com- 
ponent My). 

The operator thus averaged must be expressible in terms of operators of 
quantities describing the state of the atom as a whole. The only such vector 





t To avoid misunderstanding it should be emphasized that this refers to a closed system 
of particles or to a system of particles in a centrally symmetric external electric field. For 
example, if the nuclei are regarded as “‘fixed’’, the above statement is valid for the electrons 
in an atom, but not for those in a molecule. 

It is also assumed that there is no additional (“‘accidental’’} degeneracy of the energy level 
other than that with respect to directions of the total angular momentum. [ff this is not so 
wave functions of stationary states can be constructed which do not have any definite parity, 
and the corresponding diagonal elements of the dipole moment need not vanish. 
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z A A 
is the “vector” J. Thus the operator Q;;. must have the form 


opaa T ?._ 2f295.. 
Oix yor neck 277 S:x), (75.2) 


where the expression in parentheses is constructed so as to be symmetrical 
in the suffixes z and k and to vanish on contraction with respect to them; the 
significance of the coefficient Q will be explained later. The operators J; 
must here be understood as the familiar (§§27 and 54) matrices with respect 
to states having different values of My. The operator J? can, of course, be 
simply replaced by its eigenvalue /(/+1). 

Since the three components of the angular momentum J cannot simul- 
taneously have definite values, the same is true of the components of the 
tensor Qix. For the component Ozz, we have 


-O ee a 
= TA i 
In a state with given values of J? = /(J+1) and J, = My, Ọzz also has a 
definite value: 


Leo” =: 
O22 = joy” WU FO: (75.3) 


For My = J (when the angular momentum is “entirely” in the z-direction), 
we have Q2, = Q; this quantity is usually called simply the quadrupole 
moment. 

For J = 0, all the elements of the angular-momentum matrix are zero, 
and the operators (75.2) therefore also vanish. They likewise vanish identi- 
cally when J = 3. This is easily seen by direct multiplication of the Pauli 
matrices (55.7), which are the matrices of the components of any angular 
momentum equal to 4. 

This is no accident, but is a particular case of the general rule that the 
tensor of a 2!-pole moment (with even /) is non-zero only for states of the 
system with total angular momentum 


J 2h (75.4) 


The tensor of a 2!-pole moment is an irreducible tensor of rank / (see Fields, 
§41), and the condition (75.4) follows from the general angular-momentum 
selection rules for the matrix elements of such tensors — the condition for 
the diagonal matrix elements to be non-zero (§107). As already mentioned, 
the parity selection rules then require that / should be even. 

It should also be noted that the electric multipole moments are purely 
“orbital” quantities; their operators do not involve the spin operators. 
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Hence, 1f the spin-orbit interaction is negligible, so that L and S are separately 
conserved, the matrix elements of the multipole moments are subject to 
selection rules with respect to the quantum number L as well as /. 


PROBLEMS 
ProsLem 1. Find the relation between the operators of the quadrupole moment of an 
atom in states corresponding to various components of the fine structure of a level (i.e. states 
with different values of J but given values of L and S}. 
SoLUTION. [In states with given values of L and S, the operator of the quadrupole moment, 
a purely orbital quantity, depends only on the operator L, and so is given by the same 
formula (75.2) with f replaced by Ê and with a different constant Q. The operator (75.2) 
is obtained by a further averaging over the state with a given value of J: 
7 303 
Ow = ———_— Je + etc —- hI J + da] 
2f(2f—-3) 
301 


= meet Leli-3LL+ 18a}, (1) 


It is required to find the relation between the coefficients Qy and Qz. To do so, we multiply 
equation (1) on the left by Ji and on the right by Jk, sum over z and k, and take the eigen- 
values of the diagonel operators. We have 


JliJ = (J-L), 
2J L= f( J+ 1)+ KL4+1)—S(S+ 1). 


where, by (31.4), 


The product Ji Ly Ly Tk can be transformed by means of the formulae 


{Liba} = iele {Jili} = teum£m, 
as in §29, Problem; the result is 
Slab fe = (J.L}-].L. 
Similarly JdJSkfe = (FPF, 
IIS Ju =JXP-1). 
Thus we obtain from (1) the relation 


3).L(2].L—1)-2J + NL(L+1) 
i a 2 
QI= Qr O +IXZJ+IL2L-1) ` (2) 


In particular, for S = } this formula gives 





Qzj= QL for J= L+ł, 
__, (L-1X2L+3) } (3) 
Q= Pr ELi for J = L-t. 


PROBLEM 2. Express the quadrupole moment of an electron (charge —|e]) with orbital 
angular momentum / in terms of the mean square of its distance from the centre. 


SOLUTION. We have to average the expression 
Ozz = —lelr*(3 cos?9@—1) = —lelr2(3n22—1) 


Over a state with given angular momentum / and component m = l. The mean value of the 


angle factor is found immediately from the formula derived in §29, Problem (where fz must be 
replaced by 2); the result is 


2i 


= 2 
Cig llama: (4) 
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The sign of this quantity is opposite to that of the electron charge, as it should be: a 
particle moving with an angular momentum in the z-direction is mainly near the plane 
z = 0, and hence cos?ð < 4. 

For an electron with a given value of 7 = l + 3, formulae (3) give 


Oj = jelr?(2j—1)42)+ 2). (5) 


PROBLEM 3. Determine the quadrupole moment of an atom (in the ground state) in which 
all v electrons in excess of closed shells are in equivalent states with orbital angular momen- 
tum 7. 


SOLUTION. Since the total quadrupole moment of completed shells is zero, the quad- 
rupole moment operator of the atom is given by the sum 


3jejr? 


On = NTIS [ieiti + Dba |; 


taken.over the v outer electrons (here we have used formula (4)). 

Let us first suppose that v < 2} +1, i.e. at most half the places in the shell are occupied. 
Then, by Hund’s rule (§€7), the spins of all the v electrons are parallel (so that S = 4v). 
This means that the spin wave function of the atom is symmetrical, and the coordinate 
wave function therefore antisyrmmetrical, with respect to these electrons. Thus the electrons 
must all have different values of m, so that the greatest possible value of Mz (and the value 
of L, which is the same) is 


t 
L=(Mi)mx= È m=hb(2l—v4+l). 
m=[—y4] 


The required ỌQsz is the eigenvalue Qzz for Mz = L. We therefore have 


6\elr? 


I 
aaya, 2 ro 


=[-r+l1 


Or 


whence, on calculating the sum, 
2U2l—2v+1)  — 


& z 6 
i sa 


The final change from Q to Qy is effected by means of formula (2). 

The case of an atom whose outer shell is more than half filled is reduced to the previous 
one by considering holes instead of electrons: the result is therefore given by the same 
formula (6) with the opposite sign (the “‘hole”’ charge being +|e|), v being now taken not as 
the number of electrons but as the number of unoccupied places in the shell. 


§76. An atom in an electric field 


If an atom is placed in an external electric field, its energy levels are altered; 
this phenomenon is known as the Stark effect. 

In an atom placed in a uniform external electric field, we have a system 
of electrons in an axially symmetric field (the field of the nucleus together 
with the external field). The total angular momentum of the atom is there- 
fore, strictly speaking, no longer conserved; only the projection My of 
the total angular momentum J on the direction of the field is conserved. 
The states with different values of My have different energies, i.e. the electric 
field removes the degeneracy with respect to directions of the angular 
momentum. The removal is, however, incomplete: the states differing only 
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in the sign of My are degenerate as before. For an atom in a homogeneous 
external electric field is symmetrical with respect to reflection in any plane 
passing through the axis of symmetry (i.e. the axis passing through the nucleus 
in the direction of the field; we shall take this as the z-axis). Hence the states 
obtained from one another by such a reflection must have the same energy. 
On reflection in a plane passing through some axis, however, the angular 
momentum about this axis changes sign (the direction of a positive revolution 
about the axis becomes that of a negative one). 

We shall suppose that the electric field is so weak that the additional energy 
due to it is small compared with the distances between neighbouring energy 
levels of the atom, including the fine-structure intervals. Then, in order to 
calculate the displacement of the levels in the electric field, we can use the 
perturbation theory developed in §§38 and 39. Here the perturbation 
operator is the energy of the system of electrons in the homogeneous field &, 
and this 1s 

V= —d.&= —€d,, (76.1) 


where d is the dipole moment of the system. In the zeroth approximation, 
the energy levels are degenerate (with respect to directions of the total angular 
momentum); in the present case, however, this degeneracy is unimportant, 
and in applying perturbation theory we can proceed as if we were dealing 
with non-degenerate levels. This follows from the fact that, in the matrix 
of the quantity d, (as in that of the z-component of any vector), only the 
elements for transitions without change of My are not zero (see §29), and hence 
states with different values of My behave independently when perturbation 
theory is applied. 

The displacement of the energy levels is determined, in the first approxi- 
mation, by the diagonal matrix elements of the perturbation. But all the 
diagonal matrix elements of the dipole moment vanish (§75). ‘Thus the 
splitting of the levels in an electric field is a second-order effect with respect 
to the field.t 

Being quadratic in the field, the displacement AE, of the level En must be 
of the form 


AE, = — parE Ex, (76.2) 


where aix(™ is a symmetrical tensor of rank two; taking the z-axis in the 
direction of the field, we obtain 


AE, = — haz E.. (76.3) 


The tensor aj, is also the polarizability of the atom in the external field: 
taking the parameters A in the general formula (11.16) to be the components 
of the vector &;, and putting H = Hy—6jd;, we find that the mean value 


+t The hydrogen atom forms an exception; here the Stark effect is linear in the field (see 
the next section). The atoms of other elements, when in highly excited states (and therefore 
hydrogen-like; see §68), behave like hydrogen in sufficiently strong fields. 
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of the dipole moment of the atom, induced by the field, is 


dim = GAEn/|OG;. 


Substitution of (76.2) gives 


Gir) = vy Mb, (76.4) 


The polarizability must be calculated by the usual rules of perturbation 
theory. According to the second-approximation formula (38.10), we have 


, (di)nm( ae 
w= —2E Cen (76.5) 


The polarizability of the atom depends on its (unperturbed) state, and in . 
particular on the quantum number My. The latter dependence can be 
written in a general form. The values of oj,'" for various values of My 
may be regarded as the eigenvalues of the operator 


a = andik +a JiJe+ Jei- 88t). (76.6) 


This is the most general symmetrical tensor of rank 2 depending on the 
vector J (cf. §75). From (76.3) and (76.6) we have 


AEn = — 4an +2pr[ M3} JJ +1). (76.7) 


On summation over all values of My, the second term in the braces 
vanishes, so that the first term is the displacement of the “centre of gravity” 
of the split level. Moreover, according to (76.7) a level with J = 4 remains 
unsplit, in accordance with Kramers’ theorem (§60). 

If the atom is in a non-uniform external field (which varies only slightly 
over the dimensions of the atom), there can also exist a splitting effect linear 
in the field, due to the quadrupole moment of the atom. The operator of the 
quadrupole interaction between the system and the field has the form which 
corresponds to the classical expression (Fields, §42) for the quadrupole 
energy: 


age es, (76.8) 


Ox, OX, 





where ¢ is the potential of the electric field (the derivatives being understood 
to be taken at the position of the atom). 


PROBLEMS 
PROBLEM 1. Determine the Stark splitting of the different components of 4 multiplet 
level as a function of J. 


SOLUTION. ‘The problem is conveniently solved by changing the order in which the per- 
turbations are-applied; we first consider the Stark splitting of the level in the absence of fine 
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structure, and then bring in the spin-orbit interaction. Since the spin of the atom does not 
interact with the external electric field, the Stark splitting of a level with orbital angular 
momentum L is given by a formula of the same form (76.2), with a tensor di which is ex- 
pressed in terms of the operator L in the same way as Gx in (76.6) is expressed in terms of J: 


due = 28 +0 Lil t Leli- $5, L2), 


the suffixes n being everywhere omitted. When the spin-orbit interaction 1s included, the 
states of the atom must be described by the total angular momentum J. The averaging of 
the operator xix over states with a given value of the angular momentum J (but not of its 
component Ms) is formally identical with the averaging carried out in §75, Problem 1. We 
thus return to formulae (76.6), (76.7), with constants a, B which are given in terms of the 
constants a, b by 


3J.L{2J.L—1]— 2J + UL + 1) 
5 
JT +1X2J-1X2J + 3) 


«e=a,f = 


This determines the splitting as a function of J (but not of L and S, of course; these are 
characteristics of the unsplit term on which the constants a and b also depend). 


PROBLEM 2. Determine the splitting of a doublet level (spin S = 4) in an arbitrary 
(not weak) electric field. 


SoLution. If the splitting is not small in comparison with the interval between the 
components of the doublet, the perturbation from the electric field and the spin-orbit inter- 
action must be taken into account simultaneously, 1.e. the perturbation operator is the sum 


P = AS.L—-36%{0426[£ 2-4L(L+1)}} 


(cf. (72.4) and Problem 1). Omitting the constant terms which do not affect the splitting, 
we can write this operator in the form 


P =3A(S,£-4 §-£,428,£,]-b62L2 


(see (29.11)). For each given value of M = My the eigenvalues of this operator are determined 
by the roots of the secular equation formed from its matrix elements with respect to the 
states |Mrii[s> = |M F $, +4) From formulae (27.12) we find 

<(M—4, 4 VIM —, = LAM- i) bEXM - 3), 

(MEL -HVIM h - 4) = -1AM + -BEM +Y, 

LM- 3, SVIM}, -3> = JAV((L+ M+ Sy L-M+ 9). 


Thus (see §39, Problem 1) the level displacement is 
AE = —b62?M?+ V[LA{L + })% + b%(b€2 + A)M?], (1) 


where all terms which are the same for all components of the split doublet are omitted. 
This formula (with both signs of the root) applies to all levels with |M| s L—-}4. For 
|MM| = L—¿ there is only one state |MLMsY, and the displacement of the level is given 
simply by the corresponding diagonal matrix element, i.e., with the same choice of the 
additive constant as in (1), 


AE = (4.44 062(L4+4)—b6°L +4)? (2) 
This is the same as the result obtained from formula (1) with only one sign of the root. 


z i 3. Determine the quadrupole splitting of levels in an axially symmetric electric 
eld. 


t A similar problem for any field is in §103, Problem 6. 
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SOLUTION. In a field symmetrical about the z-axis we have 076, ax? = @2¢/0)2 = a, 
&¢d/0z* = —2a, the remaining second derivatives being zero. The quadrupole energy 
operator (76.8) is 

g E E M a R 
a aie rE = 4, Ae N SS 
A zz+ Qyy—2Qzz) ara 35) 


Replacing the operators by their eigenvalues, we obtain for the displacement of the levels 
Q 
AE = af J(J + 1) 3. M3). 
2](4f—-1) 
PROBLEM 4. Calculate the polarizability of the hydrogen atom in the ground state. 


SOLUTION. Owing to the spherical symmetry of the s state, the polarizability tensor is a 
scalar (aix = a6jx), for which we have, according to (76.5), 


the dipole moment of the electron is dz = ez, and Ep is the energy of the ground state. We 
define an auxiliary operator b by 


mie 
h dt’ 


where m is the electron mass. Then zog = (2m/h?)(E,—Ex)box, and 


Z = 


2ime? 2ime*, ; 
a = —— ? zokbko = ——(2b l 
72 2 KOKO T (zb)oo (1) 


To calculate this quantity, we need only know the result of applying b to the wave function 


Yolr). 
According to (9.2), 


m db isan B 
zØo = ——vy = —(Bb-—bB yuo. 
po TP z yo 


Denoting the function b Yo by blr) yo and noticing that yp satisfies the equation Hy, = Envy, 
where H = —h*A/2m+ U(r), we obtain for b(r) the differential equation 


$pAb+ Vb.Viio = izpo. 


By substituting 6 = f(r) cos 6 (where @ is the polar angle in spherical polar coordinates and 
z = y cos 6), this becomes 


rp +L Ly Mp oe, (2) 
Y r2 wg 
Its solution must satisfy the condition that fy, is finite asr > 0 and asr > œ. 
For the ground state of the hydrogen atom, yo = (1/ v nasg?) exp(—r/az), where ag = h?/me? 
is the Bohr radius. The solution of equation (2) which satisfies the condition stated is 
f = —iraglas+ $r). From formula (1) we now havet 


& = Sj cos? Ooo = E = i 
ap 3ap 2 


PROBLEM 5. Calculate the polarizability of an electron in a bound s state in a potential 
well with force range a such that ax < 1, with x = 4/(2m|E,|)/A and E, the electron binding 
energy. 


SoLUTION, From the condition ax < 1, in calculating the matrix element (zb)o9 we can 
neglect the region within the well, and use in all space the wave function 


+ This result will be derived by a different method in §77. 
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which relates to the region outside the well. (The normalization of this function also uses the 
condition ax < 1l; see §133.) Equation (2) in Problem 4 becomes 


bf nf -£ = ir 


and its solution satisfying the boundary conditions is f = —ir?/2«. A calculation using 
formula (1) gives 


a = me2/thexA, 


§77. A hydrogen atom in an electric field 


The levels of the. hydrogen atom, unlike those of other atoms, undergo 
a splitting proportional to the field (the linear Stark effect) in a uniform 
electric field. This is due to the occurrence of an accidental degeneracy in 
the hydrogen terms, whereby states with different / (for a given principal 
quantum number 7) have the same energy. The matrix elements of the dipole 
moment for transitions between these states are not zero, and hence the secular 
equation gives a non-zero displacement of the levels, even in the first approxi- 
mation.T 

For purposes of calculation it is convenient to choose the unperturbed 
wave functions so that the perturbation matrix is diagonal with respect to 
each group of mutually degenerate states. It is found that this is achieved 
by quantizing the hydrogen atom in parabolic coordinates. ‘The wave func- 
tions Ynnm Of the stationary states of the hydrogen atom in parabolic co- 
ordinates are given by formulae (37.15) and (37.16). 

The perturbation operator (the energy of an electron in the field &) is 
Ez = 4 &(€—7); the field is directed along the positive z-axis, and the force 
on the electron along the negative z-axis.[ We are interested in the matrix 
elements for transitions nym -> n'na m’, for which the energy (i.e. the prin- 
cipal quantum number 7) is unaltered. It is easy to see that, of these, only 
the diagonal matrix elements 


Í lnn ml Ez dy = 4E (ff (E — n?) Yn, nml dpdédy 
=} ff Jn, mè (Pi) fn, m (P2)(P1?— P) dpidpe (77.1) 


are non-zero (we have made the substitution £ = mp,, 7 = mp,). The matrix 
concerned is evidently diagonal with respect to the number m, while its 
diagonality with respect to the numbers 7, 7, follows from the orthogonality 
of the functions fn, m for different m and the same m (see below). The integra- 
tions over p, and pin (77.1) are separable; the integrals obtained are calcu- 





¢ In the following calculations we do not take account of the fine structure of the hydrogen 
levels. Hence the field must be, though not strong (for perturbation theory to be applicable), 


yet such that the Stark splitting is large in comparison with the fine structure. For the opposite 
case see ROT, §52, Problem. 


t In this section we use atomic units. 
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lated in §f of the Mathematical Appendices (integral (f.6)). After a simple 
calculation, we find for the corrections to the energy levels in the first approxi- 
mationt 


EY = 3&n(n,—n,), (77-2) 
or, in ordinary units, 


EQ) = n(n; —n2)\e|Eh2 me?. 


The two extreme components of the split level correspond to m = n—1, 
nz = 0 and n,=0, n,=n—1. The distance between these two extreme 
levels is, by (77.2), 


3&n(n—1), 


i.e. the total splitting of the level by the Stark effect is approximately pro- 
portional to n”. It is natural that the splitting should increase with the prin- 
cipal quantum number: the further the electrons are from the nucleus, the 
greater the dipole moment of the atom. 

The presence of the linear effect means that, in the unperturbed state, the 
atom has a dipole moment whose mean value is 


d, = —$n(n,—n,). (77.3) 


This is in accordance with the fact that, in a state determined by parabolic 
quantum numbers, the distribution of the charges in the atom is not sym- 
metrical about the plane z = 0 (see §37). Thus, for 7, > m, the electron is 
predominantly on the side of positive z, and hence the atom has a dipole 
moment opposite to the external field (the charge on the electron being 
negative). 

In the previous section we have shown that a uniform electric field cannot 
entirely remove degeneracy: there always remains a twofold degeneracy of 
states differing in the sign of the projection of the angular momentum on the 
direction of the field (in this case, states whose projected angular momenta 
are + m). However, we see from formula (77.2) that even this removal of 
the degeneracy does not occur in the linear Stark effect in hydrogen: the 
displacement of the levels (for given 7 and 71 — n) is independent of m and nə. 
A further removal of the degeneracy occurs in the second approximation; 
the calculation of this effect is the more interesting in that the linear Stark 
effect is altogether absent in states with nı = np. 

To calculate the quadratic effect, it is not convenient to use ordinary 
perturbation theory, since it would be necessary to deal with infinite sums 
of complicated form. Instead we use the following slightly modified method. 
_ Schrédinger’s equation for the hydrogen atom in a uniform electric field 
is of the form 


(SA +E+1/r—€z)p = 0. 


t This result was derived by K. Schwarzschild and P. Epstein (1916), using the old 
quantum theory, and by W. Pauli and E. Schrédinger (1926) using quantum mechanics. 
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Like the equation with @ = 0, it allows separation of the variables in 
parabolic coordinates. The same substitution (37.7) as was used in §37 
gives the two equations 


e r (i-e) pee 


A re) (e ih = —Pafa (77.4) 


B,+P, = 1, 


which differ from (37.8) by the presence of the terms in &. We shall regard 
the energy E in these equations as a parameter which has a definite value, 
and the quantities £}, f, as eigenvalues of corresponding operators; it 1s easy 
to see that these operators are self-conjugate. These quantities are deter- 
mined, by solving the equations, as functions of E and @, and then the condi- 
tion 8} + Be = | gives the energy as a function of the external field. 

For an approximate solution of equations (77.4), we regard the terms con- 
taining the field & as a small perturbation. In the zeroth approximation 
( E€ = 0), the equations have the familiar solutions 


A= Ve fn, mE), 
(77.5) 
fa = Ve fn, m(7€)s 


where the functions fn m are the same as in (37.16), and instead of the energy 
we have introduced the parameter 


e = (—2E). | (77.6) 
The corresponding values of f}, B} (from the equations (37.12), in which z 
must be replaced by 1/ e) are 


B = (m+4[m|+4)e, Bo = (ngt4]m]+4)e. (77.7) 


The functions f, with different 7 for a given ¢ are orthogonal, as are the eigen- 
functions of any self-conjugate operator; we have already used this fact above 
in discussing the linear effect. In (77.5) these functions are normalized by 
the conditions 


fArdg=1, fdn 


0 


The Corrections to f, and £, in the first approximation are determined by 
the diagonal matrix elements of the perturbation: 


B® =38 | &f2dé, B® = 26 f o¥f,Pdn. 
0 o 
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Calculation gives 
BY) = $8(6n,?+ 6n,|m| + m+ 6n,+3|m| +2)/e. 


The expression for 6," is obtained by replacing n, by m, and changing the 
sign. 

In the second approximation we have, by the general formulae of perturba- 
tion theory, 


py?) = kás Pnn 
i 16 nign, By (72) — pO) 


The integrals appearing in the matrix elements (€?), ,,. are calculated in §f 
of the Mathematical Appendices. The only non-zero elements are 


En na = (Eman = Ammim), 
(E)n nme = (E)n -en = Vial nt [mlm +m] — 1) /e*. 
The differences occurring in the denominators are 
By (m4) — By (m1) = emn’). 
As a result of the calculations we have 
B® = — (|| + 2, + 1)[4en® + 17(2| mln, + 2042+ |m +271.) + 18]/16e5; 


the expression for f,) is obtained by replacing n, by na Combining the 
expressions obtained and substituting in the relation £}, +f = 1, we have 
the equation 


en— E?n{17n®+51(n,—M2)?—9m®-+19]/168-+3En(n,—N,)/e2 = 1. 


Solving by successive approximations, we have in the second approximation 
for the energy E = — 4 e the expression 


1 g? 
E = — 4 Enn) nlnm na) — 9? + 19). (77.8) 
n 


T'he second term is the already familiar linear Stark effect, and the third is 
the required quadratic effect (G. Wentzel, I. Waller and P. Epstein 1926). 
We notice that this quantity is always negative, i.e. the terms are always 
displaced downwards by the quadratic effect. The mean value of the dipole 
moment is obtained by differentiating (77.8) with respect to the field; in the 
states with 7, = no it 1s 


d, = 4n'(17n2—Im?+19)8. (77.9) 
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Thus the polarizability of the hydrogen atom in the normal state (n= 1, 
m = 0) is 9/2 (see also §76, Problem 4). 

The absolute value of the energy of the hydrogen terms falls rapidly as the 
principal quantum number z increases, while the Stark splitting is increased. 
Hence it is of interest to examine the Stark effect for highly excited levels in 
fields so strong that the splitting they cause is comparable with the energy 
of the level itself, and perturbation theory is inapplicable.t This can be done 
by using the fact that states with large values of m are quasi-classical. 

By the substitution 


fh = xlvi, fz =xelvn (77.10) 


the equations (77.4) are brought into the form 














d? 2—] 
4 (ee -468)x = 0, 
de E 4g 
(77.11) 
m p (apen) = 0 
ar? 7 h TS 


Each of these equations, however, is the same in form as the one-dimensional 
Schrödinger’s equation, the part of the total energy of the particle being 
taken by łÆ£, and that of the potential energy by the functions 





By 
U,(é) = ae T +426, ae 
coals ae i (77.12) 
o) = he 


respectively. 

Figures 25 and 26 respectively show the approximate form of these 
functions (for m > 1). By Bohr and Sommerfeld’s quantization rule (48.2) 
we write 


és 
| VORE—UYE)} 4€ = (+H, 
Í (77.13) 


Í Je n a 





t The applicability of perturbation theory to high levels requires the perturbation to be 
small only in comparison with the energy of the level itself (the binding energy of the electron), 
and not with the intervals between the levels. Forin the quasi-classical case (which corresponds 
to highly excited states) the perturbation can be regarded as small if the force due to it is 


small in comparison with those acting on the particle in the unperturbed system; and this 
condition is equivalent to the one given above. 
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where nj, nz are integers.f These equations determine implicitly the depen- 
dence of the parameters f, and £, on E. Together with the equation £, + 8,.—1, 
they therefore give the energies of the levels when displaced by the electric 
field. The integrals in equations (77.13) can be reduced to elliptic integrals; 
these equations can be solved only numerically. 


Ui 


Ef 





Fic. 25 


The Stark effect in strong fields is complicated by another phenomenon, 
the ionization of the atom by the electric field (C. Lanczos 1931). The 


potential energy &z of an electron in the external field takes arbitrarily large 
negative values as z > — œ. Added to the potential energy of the electron 
within the atom, it has the effect that the region of possible motion for the 
electron (whose total energy E is negative) includes, besides the region 
inside the atom, the region of large distances from the nucleus in the direction 
of the anode. ‘These two regions are separated by a potential barrier, whose 
width diminishes as the field increases. However, in quantum mechanics 
there is always a certain non-zero probability that a particle will penetrate a 
potential barrier. In the case we are considering, the emergence of the elec- 
tron from the region within the atom, through the barrier, is simply the 
ionization of the atom. In weak fields the probability of this ionization is 





Fic. 26 


; at es 

+ A detailed investigation shows that a more exact result is obtained by writing m instead 
of m?—1 in the expressions for U, and U,. The integers 7, 7 are then equal to the parabolic 
quantum numbers. 
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vanishingly small. It increases exponentially with the field, however, and 
becomes considerable in fairly strong fields.t 


PROBLEMS 


PROBLEM 1. Determine the probability (per unit time) of the ionization of a hydrogen atom 
(in the ground state) in an electric field such that & < 1 (in ordinary units, & < mel5/A‘). 


SOLUTION.] In parabolic coordinates there is a potential barrier “‘along the 7 coordinate” 
(Fig. 26); the “extraction’’ of the electron from the atom im the direction z -> —o 
corresponds to its passage into the region of large n. To determine the ionization probability, 
it Is necessary to investigate the form of the wave function for large 7 (and small £; we shall 
see below that small values of £ are the important ones in the integral which determines the 
total probability current for the emerging electron). The wave function of the electron in the 
normal state (in the absence of the field) 1s 


Y = e Etja. (1) 


When the field is present, the dependence of 4 on £ in the region in which we are interested 
can be regarded as being the same as in (1), while to determine its dependence on 7 we have 
the equation 

am] ee. k= 2 

n? a ae a fx ees (2) 
where x = ny (the second equation (77.11) with E = —4, m = 0, Ba = 4). Let a be 
some value of 7 (within the barrier) such that 1 < 4, < 1/&. For > ņa the wave function 
is quasi-classical. Since, on the other hand, equation (2) has the form of the one-dimensional 
Schrédinger'’s equation, we can use formulae (50.2). Using as the boundary condition that 


y% must become the wave function (1) at 7 = 7), we obtain in the region outside the barrier 
the expression 


Th 


re (22) m (Hx £ ifo n+ tin), 
1p 2 


where 


We shall be interested only in the square |x|?. Hence the imaginary part of the exponent is 
unimportant. Denoting by n, the root of the equation p(n) = 0, we have 


h 


e-t exp[—2 Í | pl dn—7e]- (3) 


Ne 


|x|? = elpo 
m P 





t This phenomenon may serve to illustrate how a small perturbation may alter the nature 
of the energy spectrum. Even a weak field & is sufficient to create a potential barrier and 
produce a region, far from the nucleus, which is in principle accessible to the electron. As a 
result, the motion of the electron becomes, strictly speaking, infinite, and hence the energy 
Spectrum becomes continuous instead of discrete. Nevertheless, the formal solution obtained 
by the methods of perturbation theory has a physical significance: it gives the energy levels 
of states which are not quite but “‘almost"’ stationary. An atom that is in such a state at some 
initial instant remains in it for a long period of time. 

However, the series given by perturbation theory for the Stark splitting of the levels 
cannot be convergent in the strict sense, but is merely an asymptotic series: after a certain 


Point in the series (which becomes later as the perturbation is reduced in magnitude) the 
terms increase, not decrease, 


{t [n this problem we use atomic units. 
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In the coefficient of the exponential we put for 7 > 1 
pol = 4, p = 4V(En-1); 


in the exponent we must keep also the next term of the expansion of p(n): 


Speen = ap x 1-1) d i= z ] 
i eee exp| -en dem h 


where 9, = 1/6. Effecting the integration and neglecting ng compared with 1 wherever 
possible, we obtain 


-É 
|x|? = Ey = E (4) 
nË V(En—1) 


The total probability current through a plane perpendicular to the z-axis (i.e. the required 
ionization probability zz) is 
wm 


w = | |pl2e22mp dp, 


0 


where p is the cylindrical radius in this plane. For large ņ (and small €) we can put 
dp = dy/(Eq) = }V(n/E) dé. 
Substituting also for the velocity of the electron 


vz $ V[U- }+}En)] = V(En-1), 


we have 
w = | |xl?rV(6q— 1) dé, 
0 


that is, finally, 


w = (4/B)e 2A? (5) 


or, in ordinary units, 
w = (4m?je|9/Eh7) exp (—2rm*el[5/3Eh*). 


PROBLEM 2. Find the probability that an electron will be removed by an electric field from 
a potential well with short-range forces, in which the electron is in a bound s state. The 
electric field is assumed weak, in the sense that |el& < A?x3/m, where k = «/(2mlE])/h, 
E is the binding energy of the electron in the well and m is the electron mass (Yu. N. Demkov 
and G. F. Drukarev 1964). 


SOLUTION. As in Problem 1, for a weak electric field, large distances from the centre 
(xr > 1) are important. At these distances the wave function of the bound state of the electron 
in the well (without the field &’) has the asymptotic form 


4 = A VK, A 


r 





where A is a dimensionless constant depending on the specific form of the well.t In parabolic 
coordinates we have r = (+n), and in the region 7 > £ the wave function has the form 


la fvr Sea E (6) 


+ For example, if the radius a of the well is so small that ax <€ 1, then A = 1/v (27); 
see §133. 
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In the rest of this solution the units of mass, length and time will be m, 1/« and m/hx? 
respectively. 

The function (6) is a product of functions of € and n. In the presence of an electric field, 
the dependence of ù on $ may be taken to be the same as in (6) (cf. Problem 1). To determine 
its dependence on 7, we use Schrédinger’s equation in parabolic coordinates. Unlike the 
case of the Coulomb field, the rapid decrease of the field of the well has the result that this 
field may be neglected at the large distances important in the problem. The separation of 
variables in Schrédinger’s equation then gives again the equations (77.11), in which we must 


put E = —4}4, m = 0, and the separation parameters now satisfy the condition 

Bi + Be = (). 
The parameter 8, must be taken as $ (so that the dependence yb ~ e~¢/? satisfies the first 
equation (77.11)—approximately for small £&); then B, = —4, and the equation for 4 as a 


function of 7 Is 


y 1 t 
Ss | = eS ee, lee): = Ux. 
An? ( t 2n 4n2 z a)x x v4 


Solving this in the same way as (2), we have in place of (3) 


ix? = 12 
0 


Ipol , 2 exp (—2 | [play — 70), 
nop - 


with 


ain =f -3 : = a ier], 


Next, instead of (4) we find 


be res (-«- =) 
j V(én-1) hi 36] 
and finally, instead of (5), 
w = 7A26 exp (—2/36), 


or in ordinary units, 








mle|éc4* ( Zhi? «3 ) 
= ex — : 
hin 3ml eid 


PROBLEM 3, Find with exponential accuracy the probability that an electron will be removed 
from a potential well by a uniform variable electric field & = &, cos wt; it is assumed that 
the field frequency and amplitude satisfy the conditions 


hw < \E!, elfo < Aè’, 


where x = \/(2m|E]|)/h, and |E|1s the binding energy of the electron in the well (L. V. Keldysh 
1964).+ 


SOLUTION. With the conditions stated, the removal probability w is exponentially small. 
To calculate just the exponent (not the coefficient of the exponential), it 1s sufficient to 
regard the motion as one-dimensional in the direction of the field (the z-axis). 

It will be convenient to describe the electric field by a vector (not scalar) potential, 





+ This may refer, for example, to the ionization of a singly charged negative ion by an 
intense light-wave; here the potential well is created by the interaction of the electron with 


the neutral remainder of the atom, The condition Aw < |E] then ensures that the field of 
the electromagnetic wave may be treated classically. 
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Az = A = —(c& yw) sin wt. Then the Hamiltonian of the electron in the region outside the 
well] is 


m\ cs w 


H —( fps Jeléo sin wt) 


see (111.3). It does not contain z. With the dimensionless variables and parameters 





fa (a + sin or) Y 
čt Cy £ 
The boundary condition is that the solution Y(n, 7) with n —> O should be the same as the 
electron wave function in the well (with energy E = —]E]|), unperturbed by the wave: 
Poe as 7 30. (7) 


Since the problem is quasi-classical, we seek the solution (with exponential accuracy) in 
the form F = exp(zS), where S(n, 7) is the classical action. Since the Hamiltonian is in- 
dependent of the coordinate 7, the generalized momentum P= p is conserved along the 
classical path, so that 


T 


2 
S= — | He. 7)d7’'+np+A,  H(p,7) = a(t + = sir Or) R (8) 


T 
where A and 7, are constants. From the significance of the action as a function of the 
coordinates (see Mechanics, §43), we must take p as the value which brings the path to the 
specified point y at time 7, i.e. regard p as a function of ņ and 7 determined by the equation of 
motion &S/ép = constant: 


— { 9H(p,7') = 
na lun (9) 


the constant is chosen so that y = 0 for 7 = 79. Formulae (8) and (9) give the action as a 
function of the two constants 7) and A. In order to obtain a solution satisfying the condition 
(1) we must (as in finding the general integral of the Hamilton-Jacobi equation; see the 
footnote in Mechanics, §47) regard A as a function of 7), and 7, as a function of coordinate and 
time defined by 


Coyero: = 0. (10) 


lt is evident that A(z>) = 7); then, for y = O and 7 = 74, we have S = 7, i.e. S = 7 in 
accordance with the condition (7). Equation (10) then becomes 


H(p, 70) +1 = 0. (11) 


Equations (9) and (11) together determine the functions t(n, 7) and p(n, 7), and hence (after 
substitution in (8)) the wave function ¥(n, 7). 

The required probability x is proportional to the current density along the z-axis. In 
the classically accessible region, this is vz|"|?. The coordinate value at which this region 
begins is given by the point where im S ceases to increase. At that point (@im S/én), = 0, 
and since 6S/én = p, im p = 0; from (9) and (11) it then follows that re p = O also. From 
this condition we find the value of 7), and substituting p = 0 in (11) gives 


sin? 7o = —1, 
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whence 


Qro =isinh-y, y= 2 -V (2m Elw, 


2F lelo 





the fact that the time” 7, is imaginary corresponds to the classical impossibility of the process. 


Finally 
Ee z 
w ~ exp {-2 im [| sin? Qr’ dr’ +7 J} 


and 7 may be taken to have any real value; the imaginary part of the integral is unaffected. 
Calculation of the integral gives 


w ~ exp ae f= (i+ 73) sinh-! y — Dn. (12) 


The limiting forms of the function f(y) are 


fy) = iy for y <1, 
wlog2y-$ for y> l. 


The limiting value of w as y -> 0 corresponds to the probability of removal of the particle 
from the potential well by a constant field. 

Formula (12) is applicable if the exponent is large. For this we must in any case have 
hw & |El. 


CHAPTER XI 


THE DIATOMIC MOLECULE 


§78. Electron terms in the diatomic molecule 


IN the theory of molecules an important part is played by the fact that the 
masses of atomic nuclei are very large compared with those of the electrons. 
Because of this difference in mass, the rates of motion of the nuclei in the 
molecule are small in comparison with the velocities of the electrons. This 
makes it possible to regard the motion of the electrons as being about fixed 
nuclei placed at given distances from one another. On determining the 
energy levels U,„ for such a system, we find what are called the electron terms 
for the molecule. Unlike those for atoms, where the energy levels were 
certain numbers, the electron terms here are not numbers but functions of 
parameters, the distances between the nuclei in the molecule. The energy 
U,, includes also the electrostatic energy of the mutual interaction of the nu- 
clei, so that U,, is essentially the total energy of the molecule for a given 
arrangement of the fixed nuclei. 

We shall begin the study of molecules by taking the simplest type, the 
diatomic molecules, which permit the most complete theoretical investigation. 
The electron terms of the diatomic molecule are tunctions of only one 
parameter, the distance r between the nuclei. 

One of the chief principles in the classification of the atomic terms was 
the classification according to the values of the total orbital angular momen- 
tum L. In molecules, however, there is no law of conservation of the total 
orbital angular momentum of the electrons, since the electric field of several 
nuclei is not centrally symmetric. 

In diatomic molecules, however, the field has axial symmetry about an 
axis passing through the two nuclei. Hence the projection of the orbital 
angular momentum on this axis is here conserved, and we can classify the 
electron terms of the molecules according to the values of this projection. 
The absolute value of the projected orbital angular momentum along the 
axis of the molecule is customarily denoted by the letter A; it takes the values 
O,1,2,.... The terms with different values of A are denoted by the capital 
Greek letters corresponding to the Latin letters for the atomic terms with 
various L. Thus, for A = 0, 1, 2 we speak of 5, I] and A terms respectively; 
higher values of A usually need not be considered. 

Next, each electron state of the molecule is characterized by the total spin 
S of all the electrons in the molecule. If S is not zero, there is degeneracy 
of degree 2S +1 with respect to the directions of the total spin.t The number 
Rc karsi the fine structure due to relativistic interactions (see §§83 and 84 below). 
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2S-+1 is, as in atoms, called the multiplicity of the term, and is written as an 
index before the letter for the term; thus 3I denotes a term with A = 1, 
Ro 

Besides rotations through any angle about the axis, the symmetry of the 
molecule allows also a reflection in any plane passing through the axis. If 
we effect such a reflection, the energy of the molecule is unchanged. The 
state obtained from the reflection is, however, not completely identical with 
the initial state. For, on reflection in a plane passing through the axis of the 
molecule, the sign of the angular momentum (which is an axial vector) about 
this axis is changed. Thus we conclude that all electron terms with non-zero 
values of .\ are doubly degenerate: to each value of the energy, there corres- 
pond two states which differ in the direction of the projection of the orbital 
angular momentum on the axis of the molecule. In the case where A = 0 the 
state of the molecule is not changed at all on reflection, so that the X terms are 
not degenerate. The wave function of a E term can only be multiplied by a 
constant as a result of the reflection. Since a double reflection in the same 
plane is an identity transformation, this constant is +1. Thus we must dis- 
tinguish X terms whose wave functions are unaltered on reflection and those 
whose wave functions change sign. The former are denoted by =~, and the 
latter by 2. 

If the molecule consists of two similar atoms, a new symmetry appears, 
and with it an additional characteristic of the electron terms. A diatomic 
molecule with identical nuclei has a centre of symmetry at the point bisecting 
the line joining the nuclei.t (We shall take this point as the o:igin.) Hence 
the Hamiltonian is invariant with respect to a simultaneous change of sign 
of the coordinates of all the electrons in the molecule (the coordinates of the 
nuclei remaining unchanged). Since the operator of this transformationt 
also commutes with the orbital angular momentum operator, we have the 
possibility of classifying terms with a given value of A according to their 
parity: the wave functions of even (g) states are unchanged when the co- 
ordinates of the electrons change sign, while those of odd (u) states change sign. 
The suffixes v, g indicating the parity are customarily written with the letter 
for the term: IT, Ig, and so on. 

It is an empirical fact that the normal electron state in the great majority 
of chemically stable diatomic molecules is completely symmetrical: the 
electron wave function is invariant with respect to all symmetry transforma- 
tions in the molecule. The total spin S is zero too, in the great majority of 
cases, in the normal state. In other words, the ground term of the molecule 
is ‘LX, and it is !X~, if the molecule consists of two similar atoms. Exceptions 
to these rules are formed by the molecules Oz (whose normal term is 3E~,) 
and NO (normal term 211). 





t It has also a plane of symmetry perpendicularly bisecting the axis of the molecule. This 
element of symmetry need not be considered separately, however, since the existence of 
such a plane follows automatically from the existence of a centre of symmetry and of an 
axis of symmetry. 


J Not to be confused with that of inversion of the coordinates of all the particles in the 
molecule (cf. §86). 
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PROBLEM 


Effect the separation of variables in Schrédinger’s equation for the electron terms of the 
ion H~, using elliptic coordinates. 


SOLUTION. Schrodinger’s equation for an electron ın the field of two protons at rest 1s 
(using atomic units) 


The elliptic coordinates £, 7 are defined by 
E=(n¢n)/R. n= (re—n\iIR; 1<é< ow, —l<7<1, 


and the third coordinate ¢ is the angle of rotation about an axis passing through the two 
nuclei at a distance R apart. (See Mechanics, §48.) The Laplacian operator in these co- 
ordinates 1s 


d= aaa gO et eo -we | a 
Cae oE a a On) RAG 11 a) OEE 
Putting 


y= X(E)V(n)etre, 


we obtain for X and Y the equations 





s[e Tap LER? +2RE + A A 
dé dé ( E = gi 





[a a] ( 4ER% — A LN a0 
— — —— oh ~ =- — = > 
aye: A dp = 


where A is the separation parameter. 
Each electron term E(R) is described by three quantum numbers: A, and two “‘elliptical 
quantum numbers” n, n, which determine the number of zeros of the functions X(€) and 


Y(n). 


§79. The intersection of electron terms 


The electron terms in a diatomic molecule, as functions of the distance 7 
between the nuclei, can be represented graphically by plotting the energy as 
a function of r. It is of considerable interest to examine the intersection 
of the curves representing the different terms. 

Let U,(r), U,(7) be two different electron terms. If they intersect at some 
point, then the functions U, and U, will have neighbouring values near this 
point. To decide whether such an intersection can occur, it is convenient to 
put the problem as follows. Let us consider a point rọ where the functions 
U,(r), U(r) have very close but not equal values (which we denote by Ep E), 
and examine whether or not we can make U, and U, equal by displacing the 
point a short distance ôr. The energies E, and E, are eigenvalues of the 
Hamiltonian Hf, of the system of electrons in the field of the nuclei, which 
are at a distance r, from each other. If we add to the distance rọ an increment 
Sr, the Hamiltonian becomes H,+V, where P = r . 0H,/ér is a small cor- 
rection; the values of the functions U,, U, at the point r, +r can be regarded 
as eigenvalues of the new Hamiltonian. This point of view enables us to 
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determine the values of the terms U,(r), U,(r) at the point r,+6r by means of 
perturbation theory, V being regarded as a perturbation to the operator 

ği 

The ordinary method of perturbation theory is here inapplicable, however, 
since the eigenvalues £,, E, of the energy in the unperturbed problem are very 
close to each other, and their difference is in general not large compared with 
the magnitude of the perturbation ; the condition (38.9) is not fulfilled. Since, 
in the limit as the difference FE, —E, tends to zero, we have the case of degener- 
ate eigenvalues, it is natural to attempt to apply to the case of close eigenvalues 
a method similar to that developed in §39. 

Let Yp a be the eigenfunctions of the unperturbed operator H, which 
correspond to the energies E, Æ As an initial zero-order approximation we 
take, instead of ys, and p, themselves, linear combinations of them of the form 


Yb = Cy tcp. (79.1) 
Substituting this expression in the perturbed equation 
(+W = Ey, (79.2) 
we obtain 
(E, +V—E)t,+¢,(E,+V—E), = 0. 


Multiplying this equation on the left by ¥,* and Jo* in turn, and integrating, 
we have two algebraic equations: 


€(E,+V,,—£)+e¢V,, = 0, 
OV, +¢(E2+V2,—E) = 0, 


(79.3) 


Since the operator V is Hermitian, the matrix elements V1, and V22 are 
real, while Vis = Vo)*. The compatibility condition for these equations is 


| E+ Vn- E Vie 
| Vin E,+Vy.—-E 


whence 
E= HE, +E,+ Vit Va) vV [EE — E+ Vau— V2)? +| V2]°]. (79.4) 


This formula gives the required eigenvalues of the energy in the first approxi- 
mation. 

If the energy values of the two terms become equal at the point r,+6r 
(i.e. the terms intersect), this means that the two values of E given by formula 
(79.4) are the same. For this to happen, the expression under the radical in 
(79.4) must vanish. Since it is the sum of two squares, we obtain, as the 
condition for there to be points of intersection of the terms, the equations 


E,—E,t+Viy—Veg = 0, Vig = 0. (79,5) 
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However, we have at our disposal only one arbitrary parameter giving the 
perturbation VY, namely the magnitude êr of the displacement. Hence the 
two equations (79.5) cannot in general be simultaneously satisfied (we sup- 
pose that the functions p, ys, are chosen to be real, so that V, also is real). 

It may happen, however, that the matrix element V’,, vanishes identically; 
there then remains only one equation (79.5), which can be satisfied by a suit- 
able choice of ôr. This happens in all cases where the two terms considered 
are of different symmetry. By symmetry we here understand all possible forms 
of symmetry: with respect to rotations about an axis, reflections in planes, 
inversion, and also with respect to interchanges of electrons. In the diatomic 
molecule this means that we may be dealing with terms of different A, differ- 
ent parity or multiplicity, or (for & terms) 2+ and X- terms. 

The validity of this statement depends on the fact that the perturbation 
operator (like the Hamiltonian itself) commutes with all the symmetry 
operators for the molecule: the operator of the angular momentum about 
the axis, the reflection and inversion operators, and the operators of inter- 
changes of electrons. It has been shown in §§29 and 30 that, for a scalar 
quantity whose operator commutes with the angular momentum and in- 
version operators, only the matrix elements for transitions between states 
of the same angular momentum and parity are non-zero. This proof remains 
valid, in essentially the same form, for the general case of an arbitrary 
symmetry operator. We shall not pause to repeat it here, especially since 
in §97 we shall give another general proof, based on group theory. 

Thus we reach the result that, in a diatomic molecule, only terms of differ- 
ent symmetry can intersect, while the intersection of terms of like symmetry is 
impossible (E. Wigner and J. von Neumann 1929). If, as a result of some 
approximate calculation, we obtain two intersecting terms of the same 
symmetry, they are found to move apart on calculating the next approxi- 
mation, as shown by the continuous lines in Fig. 27. 

We emphasize that this result not only is true for the diatomic molecule, 
but is a general theorem of quantum mechanics; it holds for any case where 
the Hamiltonian contains some parameter and its eigenvalues are consequently 
functions of that parameter. 

In the terminology of group theory (see §96), the general condition for the 


ulr) 
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possible intersection of terms is that the terms should belong to different 
irreducible representations of the symmetry group of the Hamiltonian of 
the system.t 

In a polyatomic molecule, the electron terms are functions of not one but 
several parameters, the distances between the various nuclei. Let s be the 
number of independent distances between the nuclei; in a molecule of N(> 2) 
atoms, this number is s = 3N—6 for an arbitrary arrangement of the nuclei. 
Each term U,(7, ..., 7.) is, from the geometrical point of view, a surface in 
a space of s+1 dimensions, and we can speak of the intersections of these 
surfaces in manifolds of varying numbers of dimensions, from 0 (intersection 
in a point) to s—1. The derivation given above is wholly valid, except that 
the perturbation V is here determined not by one but by s parameters, the 
displacements 6r,,..., r, Even with two parameters, the two equations 
(79.5) can in general be satisfied. Thus we conclude that, in polyatomic 
molecules, any two terms may intersect. If the terms are of like symmetry, 
the intersection is given by the two conditions (79.5), from which it follows 
that the number of dimensions of the manifold in which the intersection 
occurs is s-—-2. If the terms are of different symmetry, on the other hand, 
there remains only one condition, and the intersection takes place in a mani- 
fold of s—1 dimensions. 

Thus for s = 2 the terms are represented by surfaces in a three-dimensional 
system of coordinates. The intersection of these surfaces occurs in lines 
(s—1 = 1) when the symmetry of the terms is different, and in points 
(s—2 = 0) when it is the same. It is easy to ascertain the form of the surfaces 
near the point of intersection in the latter case. The value of the energy near 
the points of intersection of the terms is given by formula (79.4). In this 
expression the matrix elements V, Vo, Vy. are linear functions of the dis- 
placements &r,, ôr, and hence are linear functions of the distances fi Ta 
themselves. Such an equation determines an elliptic cone, as we know from 
analytical geometry. Thus, near the points of intersection, the terms are 


represented by the surface of an arbitrarily situated double elliptic cone 
(Fig. 28, p. 304). 


§80. The relation between molecular and atomic terms 


As we increase the distance between the nuclei in a diatomic molecule, we 
have in the limit two isolated atoms (or ions). The question thus arises of 
the correspondence between the electron terms of the molecule and the states 
of the atoms obtained bv moving them apart (E. Wigner and E. Witmer 1928). 





t An apparent exception to this rule occurs for the electron terms of the H.+ ion. These 
are described by the angular momentum component A and the two elliptical quantum 
numbers Nx and n, (see $78, Problem). Since all these numbers are related to functions of 
different variables, there is in general nothing to prevent the intersection of terms E(R) 
which differ in the values of n and n, for the same A, even though such terms have the same 
symmetry with respect to rotations and reflections. In reality, however, the separability of 
variables in Schrédinger’s equation for this System means that its Hamiltonian has a higher 
symmetry than follows from its geometrical Properties; with respect to this complete group 
the symmetries of states with different values of ng and nn are of different types. 
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This relation is not one-to-one; if we bring together two atoms in given states, 
we may obtain a molecule in various electron states. 

Let us first suppose that the molecule consists of two different atoms. Let 
the isolated atoms be in states with orbital angular momenta L, L, and spins 
Syp Sa and let L > L} The projections of the angular momenta on the 
line joining the nuclei take the values M, = —L,, —L,+1,..., 2, and 
M, = —L,, —L; +1, ... , Lo The absolute value of the sum M, +M, deter- 
mines the angular momentum A obtained on bringing the atoms together. 
On combining all possible values of M, and M,, we find the following values 
for the numbers of times that we obtain the various values of A= |M,+™,]|: 


A=L,41, twice 
Lı+L—1 four times 
L,—-L, 2(2L,+ 1) times 
[,-L,-—1 2(2L,+1) times 
1 2(2L,+1) times 
0 2L,+1 times. 


Remembering that all terms with A # 0 are doubly degenerate, while 
those with A = 0 are not degenerate, we find that there will be 


Í term with A = Li+ La 

2 terms with A = L,+L:—1, 
2L,+1 terms with A = Lı— La (80.1) 
2L,+1 terms with A = Lı—L:— 1, 


eee ese = = o on 3 Tale eae 


2L,+1 terms with A = 0; 
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in all, (2L,+1)(L,+1) terms with values of A from 0 to L +L. 

The spins S,, S} of the two atoms combine to form the total spin of the 
molecule in accordance with the general rule for the addition of angular 
momenta, giving the following possible values of S: 


S = StS, S,+5,—1, wee y [5,—S,]. (80.2) 


On combining each of these values with each value of A in (80.1), we obtain 
the complete list of all possible terms in the molecule formed. 

For 2 terms there is also the question of sign. This is easily resolved by 
noticing that the wave functions of the molecule can be written, as r -> œœ, 
in the form of products (or sums of products) of the wave functions of the 
two atoms. An angular momentum A = 0 can be obtained either by adding 
two non-zero angular momenta of the atoms such that M, = —M,, or from 
Mı = M, = 0. We denote the wave functions of the first and second atoms 
by PO, Ym. For M = |M,| = |Me| 4 0, we form the symmetrized 
and antisymmetrized products 


Yt = Pup ty yOu, 


Y- = pP yyy. 


A reflection in a vertical plane (i.e. one passing through the axis of the mole- 
cule) changes the sign of the projection of the angular momentum on the 
axis, so that JM y, Wy; are changed into YD u, Yl) a respectively, and 
vice versa. The function y+ is thereby unchanged, while y~ changes sign; the 
former therefore corresponds to a È+ term and the latter toa L-term. Thus, 
for each value of M, we obtain one &+ and one È- term. Since M can take 
L, different values (M = 1,..., L,), we have in all L, Z+ terms and L, X- 
terms. 

If, on the other hand, M, = M, = 0, the wave function of the molecule 
is of the form Y% = 4%), Yo In order to ascertain the behaviour of the func- 
tion #9 on reflection in a vertical plane, we take a Coordinate system with 
its origin at the centre of the first atom, and the z-axis along the axis of the 
molecule, and we notice that a reflection in the vertical xz-plane is equivalent 
to an inversion with respect to the origin, followed by a rotation through 180° 
about the y-axis. On inversion, the function #9 is multiplied by Pı, where 
Pi = +1 is the parity of the given state of the first atom. Next, the result 
of applying to the wave function the operation of an infinitely small rotation 
(and therefore that of any finite rotation) is entirely determined by the total 
orbital angular momentum of the atom. Hence it is sufficient to consider 
the particular case of an atom having one electron, with orbital angular 
momentum / (and a z-component of the angular momentum m = 0); on 
putting L in place of / in the result, we obtain the required solution for any 
atom. The angular part of the wave function of an electron with m— 0 is, apart 


from a constant coefficient, Pi(cos 0) (see (28.8)). A rotation through 1807 
about the y-axis is the transformation x > —x, y > Y, z > —z or, in 
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spherical polar coordinates, r >r, 6 >7-—6, ¢ >n-—¢. Then cos 6 > 
— cos 6, and the function P;(cos 0) is multiplied by (— 1). 

Thus we conclude that, as a result of reflection in a vertical plane, the func- 
tion Yo is multiplied by (—1)4,P1. Similarly, %@9 is multiplied by 
(—1)4.Pe, so that the wave function % = Ooo is multiplied by 
(—1)4,.+4,P1P:2. The term is E+ or X- according as this factor is +1 or —1. 

Summarizing the results obtained, we find that, of the total number 
2L2+1 of X terms (with every possible multiplicity), L2+1 terms are 
utand Lo are X~, if (-1)4+4,P)P2 = +1, and vice versaif (—1)4,*+4,P\P2 
= —1, 

Let us now turn to a molecule consisting of similar atoms. The rules for 
the addition of the spins and orbital angular momenta of the atoms to form 
the total S and A for the molecule remain the same here as for a molecule 
composed of different atoms. The difference is that the terms may be even 
or odd. Here we must distinguish two cases, according as the combined 
atoms are in the same or different states. 

If the atoms are in different states,t the total number of possible terms is 
doubled in comparison with the number when the atoms are different. Fora 
reflection with respect to the origin (this being the point bisecting the axis of 
the molecule) results in an interchange of the states of the two atoms. Sym- 
metrizing or antisymmetrizing the wave function of the molecule with respect 
to an interchange of the states of the atoms, we obtain two terms (with the 
same A and S), of which one is even and the other odd. Thus we have al- 
together the same number of even and odd terms. 

If, on the other hand, both atoms are in the same state, the total number of 
states is the same as for a molecule with different toms. An investigation 
which we shall not give here on account of its length{ leads to the following 
results for the parity of these states. Let Nz, Nu, be the numbers of even and 
odd terms with given values of A and S. Then 

if A is odd, N; = Nz; 

if A is even and S is even (S = 0, 2, 4, ...), Ne = Nutl; 

if A is even and S is odd (S = 1, 3, 5, ...), Nu = Net. 

Finally, we must distinguish, among the E terms, between Xtand 2, Here, 

if Sis even, Net = No +1 = Ll; 

if Sis odd, Nut = No +1 = L+1, 
where L = L,= L. All the £+ terms are of parity (—1)%, and all 27 
terms are of parity (—1)§+. 

Besides the problem that we have examined of the relation between the 
molecular terms and those of the atoms obtained as r > œ, we may also 
propose the question of the relation between the molecular terms and those 
of the “composite atom” obtained as 7 — 0, i.e. when both nuclei are brought 
to a single point (for example, between the terms of the Hy molecule and those 


of the He atom). The following rules can be deduced without difficulty. 
From a term of the “composite” atom having spin S, orbital angular momen- 


t In particular, we may be discussing the combination of a neutral and an ionized atom. 
t See E. Wigner and E. Witmer, Zeitschrift für Physik 51, 859, 1928. 
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tum L and parity P, we can obtain, on moving the constituent atoms apart, 
molecular terms with spin S and angular momentum about the axis A = 
0, 1,..., L, with one term for each of these values of A. The parity of the 
molecular term is the same as the parity P of the atomic term (g for P = +1 
and u for P = —1). The molecular term with A = 0 is a == term if 
(-—1)£P = +1, anda 5E- termif(—1)£P = —1. 


PROBLEMS 


PROBLEM 1. Determine the possible terms for the molecules H}, Na, Oj, Cl, which can 
be obtained by combining atoms in the normal state. 


SOLUTION. According to the rules given above, we find the following possible terms: 
H, molecule (atoms in the +S state): 


> aD ae 
N, molecule (atoms in the 4S state): 


eaten Eth SBR AEH: 


Cl, molecule (atoms in the ?P state): 
er ergy My PT, A Cet, FZ, AM OM Mg 
O, molecule (atoms in the $P state): 
ante, a aes aS H, Aa aD A gopa S si, 3A 
BE, SE-,, a MM, A, 


uF 


The figures in front of the symbols indicate the number of terms of the type concerned, if 
this number exceeds unity. 


PROBLEM 2. The same as Problem 1, but for the molecules HCl, CO. 

SOLUTION. When unlike atoms are combined, the parity of their states is important 
also. From formula (31.6) we find that the normal states of the H. O and C atoms are even, 
while that of the Cl atom is odd (see Table 3 for the electron configurations of these atoms). 
From the rules given above, we have 

HCI molecule (atoms in the ?Sy and ?P, states): 

3E+ LA; 
CO molecule (both atoms in the 3P, state): 


2355+ L355- 2L3S5[], 13.54, 


§81. Valency 


The property of atoms of combining with one another to form molecules 
is described by means of the concept of valency. To each atom we ascribe 
a definite valency, and when atoms combine their valencies must be mutually 
satisfied, i.e. to each valency bond of an atom there must correspond a 
valency bond of another atom. For example, in the methane molecule 
CHa, the four valency bonds of the quadrivalent carbon atom are satisfied 
by the four univalent hydrogen atoms. In going on to give a physical inter- 


pretation of valency, we shall begin with the simplest example, the com- 
bination of two hydrogen atoms to form the molecule H,. 
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Let us consider two hydrogen atoms in the ground state (2S). When they 
approach, the resulting system may be in the molecular state 12+, or ?+,. 
The singlet term corresponds to an antisymmetrical spin wave function, and 
the triplet term to a symmetrical function. The coordinate wave function, 
on the other hand, is symmetrical for the 1X term and antisymmetrical for the 
32 term. It is evident that the ground term of the H, molecule can only be 
the 12 term. For an antisymmetrical wave function ¢({r,, r2) (where r, and r, 
are the radius vectors of the two electrons) always has nodes (since it vanishes 
for r, = r,), and hence cannot belong to the lowest state of the system. 

A numerical calculation shows that the electron term 1% in fact has a deep 
minimum corresponding to the formation of a stable H, molecule. In the 
8 state, the energy U(r) decreases monotonically as the distance between 
the nuclei increases, corresponding to the mutual repulsion of the two hydro- 
gen atomst (Fig. 29). 





Fic. 29 


Thus, in the ground state, the total spin of the hydrogen molecule is zero, 
S=0. It is found that the molecules of practically all chemically stable 
compounds of elements of the principal groups have this property. Among 
inorganic molecules, exceptions are formed by the diatomic molecules O, 
(ground state 3£) and NO (ground state IT) and the triatomic molecules 
NO,, ClO, (total spin S = }). Elements of the intermediate groups have 
special properties which we shall discuss below, after studying the valency 
properties of the elements of the principal groups. 

The property of atoms of combining with one another is thus related to 
their spin (W. Heitler and H. London 1927). The combination occurs in 


+ Here we ignore the van der Waals attractson forces between the atoms (see §89). The 
existence of these forces causes a minimum (at a greater distance) on the U(r) curve for the 
85 term also. This minimum, however, is very shallow in comparison with that on the *2 
curve, and would not be perceptible on the scale of Fig. 29. 
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such a way that the spins of the atoms compensate one another. As a quanti- 
tative characteristic of the mutual combining powers of atoms, it is convenient 
to use an integer, twice the spin of the atom. This is equal to the chemical 
valency of the atom. Here it must be borne in mind that the same atom may 
have different valencies according to the state it is in. 

Let us examine, from this point of view, the elements of the principal 
groups in the periodic system. The elements of the first group (the first 
column in Table 3, the group of alkali metals) have a spin S = + in the normal 
state, and accordingly their valencies are unity. An excited state with a higher 
spin can be attained only by exciting an electron from a completed shell. 
Accordingly, these states are so high that the excited atom cannot form a stable 
molecule. + 

The atoms of elements in the second group (the second column in Table 3, 
the group of alkaline-earth metals) have a spin S = 0 in the normal state. 
Hence these atoms cannot enter into chemical compounds in the normal state. 
However, comparatively close to the ground state there is an excited state 
having a configuration sp instead of s? in the incomplete shell, and a total 
spin S = 1. The valency of an atom in this state is 2, and this is the principal 
valency of the elements in the second group. 

The elements of the third group have an electron configuration s’p in the 
normal state, with a spin S = 4. However, by exciting an electron from the 
completed s-shell, an excited state is obtained having a configuration sp? and a 
spin S = 3/2, and this state lies close to the normal one. Accordingly, the 
elements of this group are both univalent and tervalent. The first two ele- 
ments in the group (boron, aluminium) behave only as tervalent elements. 
The tendency to exhibit a valency 1 increases with the atomic number, and 
thallium behaves equally as a univalent and as a tervalent element (for example, 
in the compounds TICi and TICI,). This is due to the fact that, in the first 
few elements, the binding energy in the tervalent compounds is greater than 
for the univalent compounds, and this difference exceeds the excitation energy 
of the atom. 

In the elements of the fourth group, the ground state has the configuration 
s*p* with a spin of 1, and the adjacent excited state has a configuration sp? 
with a spin 2. The valencies 2 and 4 correspond to these states. As in the 
third group, the first two elements (carbon, silicon) exhibit mainly the higher 
valency (though the compound CO, for example, forms an exception), and 
the tendency to exhibit the lower valency increases with the atomic number. 

In the atoms of the elements of the fifth group, the ground state has the 
configuration s?p? with a spin S = 3/2, so that the corresponding valency 
is three. An excited state of higher spin can be obtained only by the transi- 
tion of one of the electrons into the shell with the next higher value of the 
principal quantum number. The nearest such state has the configuration 
sp3s’ and a spin S = 5/2 (by s’ we conventionally denote here an s state of an 
electron with a principal quantum number one greater than in the state s). 


t See the end of this section for the elements copper, silver and gold. 
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Although the excitation energy of this state is comparatively high, the 
excited atom can still form a stable compound. Accordingly, the elements of 
the fifth group behave as both tervalent and quinquevalent elements (thus, 
nitrogen is tervalent in NH, and quinquevalent in HNO,). 

In the sixth group of elements, the spin is 1 in the ground state (configura- 
tion s*p*), so that the atom is bivalent. The excitation of one of the p electrons 
leads to a state s*p*s’ of spin 2, while the excitation of an s electron in addition 
gives a state sp*s’p’ of spin 3. In both excited states the atom can enter into 
stable molecules, and accordingly exhibits valencies of 4 and 6. The first 
element of the sixth group (oxygen) shows only the valency 2, while the sub- 
sequent elements show higher valencies also (thus, sulphur in H,S, SO}, SOQ, 
is respectively bivalent, quadrivalent and sexivalent). 

In the seventh group (the halogen group), the atoms are univalent in the 
ground state (configuration sp, spin S = $). They can, however, enter into 
stable compounds when they are in excited states having configurations 
s2p'is’, s2p8s'p', sp3s'p’2 with spins 3/2, 5/2, 7/2 and valencies 3, 5, 7 respec- 
tively. The first element in the group (fluorine) is always univalent, but the 
subsequent elements also exhibit the higher valencies (thus, chlorine in HCI, 
HCIO, HCIO, HCIO, is respectively univalent, tervalent, quinquevalent 
and septivalent). 

Finally, the atoms of the elements in the group of inert gases have com- 
pletely filled shells in their ground states (so that the spin S = 0), and their 
excitation energies are high. Accordingly, the valency is zero, and these 
elements are chemically inactive.t 

The following general remark should be made concerning all these discus- 
sions. The assertion that an atom enters into a molecule with a valency per- 
taining to an excited state does not mean that, on moving the atoms apart to 
large distances, we necessarily obtain an excited atom. It means only that the 
distribution of the electron density in the molecule is such that, near the 
nucleus of the atom in question, it is close to that in the isolated and excited 
atom; but the limit to which the electron distribution tends as the distance 
between the nuclei is increased may correspond to non-excited atoms. 

When atoms combine to form a molecule, the completed electron shells in 
the atoms are not much changed. The distribution of the electron density 
in the incomplete shells, on the other hand, may be considerably altered. In 
the most clearly defined cases of what is called heteropolar binding, all the 
valency electrons pass over from their own atoms to other atoms, so that we 


+ Some of them nevertheless form stable compounds with fluorine and oxygen. These 
valencies may be due to a transfer of electrons from the outermost complete shell to the 
incomplete f or d states, whose energies are comparatively near. l å 

There is also an attraction which occurs in the interaction of an inert gas atom with an 
excited atom of the same element. This is due to the doubling in the number of possible 
states obtained on bringing together two atoms, if these atoms are of the same lemenni bys 
in different states (see §80). The transition of the excitation from one atom to es ot a ay 
replaces the exchange interaction which brings about the ee valency. 4 aa epa 
He, is an example of such a molecule. The same type of bond occurs in molecular 10 
composed of two similar atoms (for instance, Hg+}. 
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may say that the molecule consists of ions with charges equal (in units of e) 
to the valency. The elements of the first group are electropositive: in hetero- 
polar compounds they lose electrons, forming positive ions. As we pass to 
the subsequent groups the electropositive character of the elements becomes 
gradually less marked and changes into electronegative character, which is 
present to the greatest extent in the elements of the seventh group. Regard- 
ing heteropolarity the same remark should be made as was made above con- 
cerning excited atoms in the molecule. Ifa molecule is heteropolar, this does 
not mean that, on moving the atoms apart, we necessarily obtain two ions. 
Thus, from the molecule CsF we should in fact obtain the ions Cs+ and F-, 
but the molecule NaF gives in the limit the neutral atoms Na and F (since 
the affinity of fluorine for an electron is greater than the ionization potential 
of caesium but less than that of sodium). 

In the opposite limiting case of what is called homopolar binding, the atoms 
in the molecule remain neutral on the average. Homopolar molecules, un- 
like heteropolar ones, have no appreciable dipole moment. The difference 
between the heteropolar and homopolar types is purely quantitative, and any 
intermediate case may occur. 

Let us now turn to the elements of the intermediate groups. Those of 
the palladium and platinum groups are very similar to the elements of the 
principal groups as regards their valency properties. The only difference is 
that, owing to the comparatively deep position of the d electrons inside the 
atom, they interact only slightly with the other atoms in the molecule. Asa 
result, ‘‘unsaturated’’ compounds, whose molecules have non-zero spin 
(though in practice not exceeding 4), are often found among the compounds 
of these elements. Each of the elements can exhibit various valencies, and 
these may differ by unity, and not only by two as with the elements of the 
principal groups (where the change in valency is due to the excitation of some 
electron whose spin is compensated, so that the spins of two electrons are 
simultaneously released). 

The elements of the rare-earth group are characterized by the presence of an 
incomplete f shell. The f electrons lie much deeper than the d electrons, 
and therefore take no part in the valency. Thus the valency of the rare- 
earth elements is determined only by the s and electrons in the incomplete 
shells.t However, it must be borne in mind that, when the atom is excited, 
f electrons may pass into s and p states, thereby increasing the valency by 
one. Hence the rare-earth elements too exhibit valencies differing by unity 
(in practice they are all tervalent and quadrivalent). 

The elements of the actinium group occupy a unique position. Actinium 
and thorium have no f electrons, and their valencies involve d electrons. 
In their chemical properties they are therefore analogous to elements of the 
palladium and platinum groups, not to the rare earths. The uranium atom in 
the normal state contains f electrons, but in its compounds it too has no f 


+ Thed electrons which are found in the incomplete shells of the atoms of some rare-earth 
elements are unimportant, since these atoms in practice always form compounds in excited 
states where there are no d electrons. à 
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electrons. Finally, the atoms of the elements neptunium, plutonium, 
americium and curium contain f electrons in compounds also, but the electrons 
which participate in their valencies are again s and d electrons. In this sense 
they are homologues of uranium. The maximum possible number of ‘“‘un- 
paired” s and d electrons is one and five respectively, and so the maximum 
valency of elements in the actinium group is six, whereas the maximum 
valency of the rare-earth elements (with s and p electrons participating in 
the valency) is 1+3 = 4. 

The elements of the iron group occupy, as regards their valency properties, 
a position intermediate between the rare-earth elements and those of the palla- 
dium and platinum groups. In their atoms, the d electrons lie comparatively 
deep, and in many compounds take no part in the valency bonds. In these 
compounds, therefore, the elements of the tron group behave like rare-earth 
elements. Such compounds include those of ionic type (for instance FeCl, 
FeCl,), in which the metal atom enters as a simple cation. Like the rare- 
earth elements, the elements of the iron group can show very various valencies 
in these compounds. 

Another type of compound of the iron-group elements is formed by what 
are called complex compounds. ‘These are characterized by the fact that the 
atom of the intermediate element enters into the molecule not as a simple 
ion, but as part of a complex ion (for instance the ion MnO,- in KMn0O,, or 
the ion Fe(CN),4- in K,Fe(CN),). In these complex ions, the atoms are 
closer together than in simple ionic compounds, and in them the d electrons 
take part in the valency bond. Accordingly, the elements of the iron group 
behave in complex compounds like those of the palladium and platinum groups. 

Finally, it must be mentioned that the elements copper, silver and gold, 
which in §73 we placed among the principal groups, behave as intermediate 
elements in some of their compounds. These elements can exhibit valencies 
of more than one, on account of a transition of an electron from a d shell toa 
p shell of nearly the same energy (for example, from 3d to 4p in copper). In 
such compounds the atoms have an incomplete d shell, and hence behave as 
intermediate elements: copper like the elements of the iron group, and silver 
and gold like those of the palladium and platinum groups. 


PROBLEM 


Determine the electron terms of the molecular ion Het obtained when a hydrogen atom in 
the normal state combines with an H+ ion, for distances R between the nuclei large compared 
with the Bohr radius (L. Landau 1961; C. Herring 1961).+ 


SOLUTION. This problem is analogous in form to §50, Problem 3: instead of two 
one-dimensional potential wells we have here two three-dimensional wells (round the two 
nuclei) with axial symmetry about the line joining the nuclei. The level J Ey = —}4 (the ground 
level of the hydrogen atom) is split into two levels U,(R) and U,(R) (the terms #2, and 


+ For the corresponding problem of the H, molecule, see L. P. Gor’kov and L. P. Pitaevskil, 
Soviet Physics Doklady 8, 788, 1964; C. Herring and M. F licker, Physical Review 134, A362, 
1964 (the second of these papers corrects an error of calculation in the first). 

t Here we are using atomic units. 
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224+), corresponding to the electron wave functions 
1 
by.ul%y,2) = goers?) + do(—~,y,2)], 


which are symmetrical and antisymmetrical about the plane x = 0 which bisects the line 
joining the nuclei (which are at (+4, 0,0)). Here Yo(x, y, z) is the wave function of the 
electron in one of the potential wells. Exactly as in §50, Problem 3, we find 


ĉ 
Up.u(R)— Eo = F { Í Taa dy dz, (1) 
X 
where the integration is over the plane x = 0.T 
The function yo (corresponding to motion around nucleus 1, say, at x = 4R) is sought 
in the form 





a , (2) 
TT 


where a is a slowly varying function (for a hydrogen atom, a = 1). The function yo must 
satisfy Schrédinger’s equation 


tout (-b- to +—\- 0, (3) 


where 71, 72 are the distances of the electron from nuclei 1 and 2. In this equation the total 
energy of the electron is Eo—1/R, since Eo itself includes the energy 1/R of the Coulomb 
repulsion of the nuclei. 

Since the function yo decreases rapidly away from the x-axis, only the region where y 
and z are small compared with R is important in the integral (1). For y, z < R, substitution 
of (2) in (3) gives 


da a a 


éx $R+x R 





here we have neglected the second derivatives of the slowly varying function a and put 
r, œ~ 4$4R+x. The solution of this equation which becomes unity as x >4R (i.e. in the 
neighbourhood of nucleus 1) 1s 





2R ( x } 
a= exp| — —4]}. 
R+2x s R ) 
Formula (1) now gives 


Us,u—£o = +— Í e-r | 27, dri 
TE 
R/2 


= $2Re-R-!, 


The amount of the splitting ist 


U,—U = —4Re-R-!, (4) 


At sufficiently large distances this expression decreases exponentially and becomes less 
than the effect in the second approximation with respect to the dipole interaction of the H 
atom and the H~ ion. Since the polarizability of the hydrogen atom in the normal state 1s 
9/2 (see (77.9)), and the field of the H* ion is £ = 1/R?, the corresponding interaction energy 


t Note that the effect sought 1s therefore determined by the range of distances at which the 
electron interacts in the same way with both nuclei 


t The corresponding result for the H; molecule, according to the papers quoted above, is 


Ug-— CL = = 1-64R5/2¢ -2R. 
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is —9/4R4, and when this is taken into account we have 


2 9 
Uy,u(R)— Eo = + = 4R (5) 


The second term becomes comparable with the first when R = 10-8. It may also be noted 
that the term U, has a minimum of —5-8 x 10-5 atomic unit (—1-6 x 10-3 eV) when R = 12-6.t 


§82. Vibrational and rotational structures of singlet terms in the 
diatomic molecule 


As has been pointed out at the beginning of this chapter, the great differ- 
ence in the masses of the nuclei and the electrons makes it possible to divide 
the problem of determining the energy levels of a molecule into two parts. 
We first determine the energy levels of the system of electrons, for nuclei at 
rest, as functions of the distance between the nuclei (the electron terms). 
We can then consider the motion of the nuclei for a given electron state; this 
amounts to regarding the nuclei as particles interacting with one another in 
accordance with the law U,(r), where Up is the corresponding electron term. 
The motion of the molecule is composed of its translational displacement as a 
whole, together with the motion of the nuclei about their centre of mass. The 
translational motion is, of course, without interest, and we can regard the 
centre of mass as fixed. 

For convenience of discussion, let us first consider the electron terms in 
which the total spin S of the molecule is zero (the singlet terms). The 
problem of the relative motion of two particles (the nuclei) which interact 
according to the law U(r) reduces to that of the motion of a single particle 
of mass M (the reduced mass of the two particles) in a centrally symmetric 
field U(r). By U(r) we mean the energy of the electron term considered. 
The problem of motion in a centrally symmetric field U(r), however, 
reduces in turn to that of a one-dimensional motion in a field where the 
effective energy is equal to the sum of U(r) and the centrifugal energy. 

We denote by K the total angular momentum of the molecule, composed 
of the orbital angular momentum L of the electrons and the angular momen- 
tum of the rotation of the nuclei. Then the operator of the centrifugal energy 
of the nuclei is 


B(r(K-L), 
where we have introduced the notation 
B(r) = h2/2Mr® (82.1) 


which is customary in the theory of diatomic molecules. Averaging this 
quantity over the electron state (for a given 7), we obtain the centrifugal 


+ This minimum, which is due to van der Waals forces, is very shallow compared with 
that of the term U,(R) which corresponds to the normal state of the stable ion H,+: the 
latter minimum is —0-60 atomic unit (—16-3 eV), at R = 2-0. 
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energy as a function of 7, which must appear in the effective potential energy 


U,(r). Thus 
Ux(r) = U(r)+B7)(K—-L), (82.2) 


where the line denotes the average mentioned. 

Let us carry out the averaging for a state in which the molecule has a 
definite value of the square of the total angular momentum K? = K(K+1) 
(where K is integral) and a definite value of the component of the electron 
angular momentum along the axis of the molecule (the z-axis) L; = A. 
Expanding the parenthesis in (82.2), we have 


U(r) = U(r)+ B(r)K(K + 1) —2B(7) L. K+ B(r)L2. (82.3) 


The last term depends only on the electron state and does not contain the 
quantum number K; it may be simply included in the energy U(r). We shall 
show that the same is true of the term preceding it. 

If the component of the angular momentum along an axis has a definite 
value, the mean value of the angular momentum vector is also along that axis; 
see the end of §27. If n denotes a unit vector along the z-axis, we therefore 
have L = An. In classical mechanics, the angular momentum of rotation of a 
system of two particles, such as nuclei, is r x p, where r = rn is the radius 
vector between the two particles and p the momentum of their relative 
motion. This quantity is perpendicular to n. In quantum mechanics, 
the same will be true of the operator of the angular momentum of rotation 
of the nuclei: (K—L).n = 0, or K.n = L.n. Since the operators are equal, 
so of course are their eigenvalues, and, since n.L = L; = A, we have 


K.n=A. (82.4) 


Thus, in the term before the last in (82.3), L.K = n.KA = A2, and is 
independent of K. Redefining the function U(r), we can finally write the 
effective potential energy as 


Ux(r) = U(r)+ B(r)K(K + 1). (82.5) 


From the equation K, = .\ it follows that, for a given value of .\, the 
quantum number K can take only values 


K >A. (82.6) 


On solving the one-dimensional Schrödinger’s equation with the potential 
energy (82.5), we obtain a series of energy levels. We arbitrarily number these 
levels (for each given K) in order of increasing energy, using a number 
v = 0, 1, 2, ... ; v = 0 corresponds to the lowest level. Thus the motion 
of the nuclei causes a splitting of each electron term into a series of levels 
characterized by the values of the two quantum numbers K and v. 
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The number of these levels (for a given electron term) may be either finite 
or infinite. If the electron state is such that, as 7 > œ, the molecule becomes 
two isolated neutral atoms, then as r > œ the potential energy U(r) (and 
therefore Ug(r)) tends to a constant limiting value U(co) (the sum of the 
energies of the two isolated atoms) more rapidly than 1/r tends to zero (see 
§89). The number of levels in such a field is finite (see §18), though in actual 
molecules it is very large. The levels are so distributed that, for any given 
value of K, there is a definite number of levels (with different values of v), 
while the number of levels with the same K diminishes as K increases, until 
a value of K is reached for which there are no levels at all. 

If, on the other hand, as 7 > œ the molecule disintegrates into two ions, 
at large distances U(r)— U(co) becomes the energy of the attraction of the 
ions according to Coulomb’s law (~ 1/r). In such a field there is an infinite 
number of levels, which become closer and closer as we approach the limiting 
value U(co). We may remark that, for the majority of molecules, the previous 
case is found in the normal state; only a comparatively small number of mole- 
cules become pairs of ions when their nuclei are moved apart. 

The dependence of the energy levels on the quantum numbers cannot be 
completely calculated in a general form. Such a calculation is possible only 
for low excited levels which lie not too far above the ground level.t Small 
values of the quantum numbers K and v correspond to these levels. It is 
with such levels that we are in fact usually concerned in the study of molecular 
spectra, and hence they are of particular interest. 

The motion of the nuclei in slightly excited states can be regarded as small 
vibrations about the equilibrium position. Accordingly we can expand U(r) 
in a series of powers of € = r—r,, where r, is the value o; r for which U(r) 
has a minimum. Since U'(r,) = 0, we have as far as terms of the second 
order 


U(r) = Ust MoE, 


where Ue = U(re), and we is the frequency of the vibrations. 

In the second term in (82.5)—the centrifugal energy—it is sufficient to 
put r = 7,, since it already contains the small quantity K(K +1). Thus we 
have 


Ur(r) = U,+B,K(K+1)+3Mw,’6, (82.7) 


where B, = h?/2Mr,? = h?/2I is what is called the rotational constant 
(J = Mre is the moment of inertia of the molecule). 

The first two terms in (82.7) are constants, while the third corresponds 
to a one-dimensional harmonic-oscillator. Hence we can at once write down 
the required energy levels: 


E = U,+B,K(K+1)+hw,(v+4). (82.8) 


+ We refer always to levels belonging to the same electron term. 
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Thus, in the approximation considered, the energy levels are composed of 
three independent parts: 


E = Ea tE, +E, (82.9) 


The first term, Ea = Uae is the electron energy (including the energy of the 
Coulomb interaction of the nuclei forr = re); the second term is 


E, = B,K(K+1) (82.10) 
the rotational energy from the rotation of the molecule,f and the third term 


E, = hw,(v+}) (82.11) 


is the energy of the vibrations of the nuclei within the molecule. The number 
v denumerates, by definition, the levels with a given K in order of increasing 
energy; it is called the vibrational quantum number. 

For a given form of the potential energy curve U(r), the frequency w, is 
inversely proportional to VM. Hence the intervals AE, between the 
vibrational levels are proportional to 1/1/44. The intervals AF, between 
the rotational levels contain in the denominator the moment of inertia Z, and 
are therefore proportional to 1/M. The intervals AE „ between the electron 
levels, however, are independent of M, like the levels themselves. Since m/M 
(m being the electron mass) is a small parameter in the theory of diatomic 
molecules, we see that 


AE,, > AE, > AE, (82.12) 


This shows the rather unusual distribution of the energy levels of the mole- 
cule. The vibrational motion of the nuclei splits the electron terms into 
levels lying comparatively close together. These levels, in turn, exhibit a fine 
splitting due to the rotational motion of the molecule.] 

In subsequent approximations, the separation of the energy into indepen- 
dent vibrational and rotational parts is impossible; rotational-vibrational 
terms appear, which contain both K and v. On calculating the successive 


t The wave function describing the rotation of a diatomic molecule (without spin) is 
essentially the same as that of a symmetrical top (§103). Unlike the top, the molecule has a 
rotation described by only two angles (x = ¢, B = 8), which define the direction of tts 
axis. ‘The rotational wave function differs from (103.8) by the absence of the factor e'*’/4/(27) 
and in the notation for the quantum numbers. Since, by (82.4), the number A is equal to the 
component of the total angular momentum K along the axis of the molecule (the ¢-axis in 
$103), we must replace J, W and k by K, M and A (here M = Kz). Thus 





beol 8) = ik TEE Danile, 8, 0) 


J As an example, we give the values of Us, fiwe and Be (in electron-volts) for a few 
molecules: 


Ha Na 0; 
m +-7 TAS 52 
hwe 0-54 0-29 0-20 
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approximations, we should obtain the levels F as an expansion in powers of the 
quantum numbers K and v. 

We shall calculate here the next approximation after (82.8). To do this, 
we must continue the expansion of U(r) in powers of £ up to terms of the 
fourth order (cf. the problem of an anharmonic oscillator in §38). Similarly. 
the expansion of the centrifugal energy is extended as far as the terms in £. 
We then obtain 

Ux(r) = U,+}Mo2é?+ (}?/2Mr2)K(K+1)— 


— at? bé4— (h?/Mr3)K( K+ 1)€+ (3h?/2Mr,{)K(K-+ 1)é. (82.13) 


Let us now calculate the correction to the eigenvalues (82.8), regarding the 
last four terms in (82.13) as the perturbation operator. Here it is sufficient, 
for the terms in ¿2 and £$, to take the first approximation of perturbation 
theory, but for those in é and €3 we must calculate the second approximation, 
since the diagonal matrix elements of € and £3 vanish identically. All the 
matrix elements needed for the calculation are derived in §23 and in §38, 
Problem 3. As a result, we obtain an expression which is usually written in 
the form 


E = Ey, +hw,(v+4)—x,fiw,(vt+4h)?+B,K(K+1)—D,K{K+1)?, (82.14) 
where 
By = B,—2,(v+4) = By—a,v- (82.15) 
The constants x,, Beo «,, D, are related to the constants appearing in (82.13) 
by 
B, = he/27, D, = 4Bé/FuZ, 


6B? / aħ 2 3 hones we 
ni 
iw, \Mw?2N MB, 2hw,\Mw,/ L2 Mw? 


(82.16) 

















The terms independent of v and K are included in E, 


PROBLEM 


Determine the accuracy of the approximation which gives the separation of the electron 
and nuclear motions in a diatomic molecule. 3 

SOLUTION. The total Hamiltonian of the molecule may be written H = 7,+He1, where 
T, = §?/2M is the operator of the kinetic energy of the relative motion of the nuclei 
( = —1ih@/er; r is the radius vector joining the nuclei; M is the reduced mass). The 
Hamiltonian Hi.) includes the operators of the kinetic energy of the electrons, the potential 
energy of their Coulomb interaction with one another and with the nuclei, and the energy 
of the Coulomb interaction of the nuclei.t The solution of Schrédinger’s equation 


Ay = (f+ Bu) = Ey (1) 


is sought in the form 
Y = È xm(t)bmlg. 7), (2) 


+ The Hamiltonian Ê relates to a frame of reference in which the centre of mass of the 
whole molecule is at rest (Pan +Pe = 0, where Pr is the total momentum of the two nuclei, and 
P, that of the electrons). We do not, however, include in it the term corresponding to the 
kinetic energy of the motion of the centre of mass P,,7/2(M, + Mo) = £,?/2(M1+Me). This 
term is certainly small, in the ratio m/M, compared with the kinetic energy of the electrons. 
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where the functions ¢m(q, r) are orthonormalized solutions of the equation 
Babna, r) = Un(r)bnlg r), (3) 


q denoting the set of electron coordinates; Um(r) are the eigenvalues of the Hamiltonian Aa, 
which depend on r as a parameter. Substituting (2) in (1), multiplying by ¢n*(g, r) and 
integrating over q, we obtain 


As 


ae Vnan + U(r) -E |ia) ae x tar ag V'nm)Xn(¥), (+) 


where 


Í A ree 1 ? 
Yan = RT ii -p, V'am = zy nm 


and pam = f bn*B bm dq; (p*)nm are the matrix elements with respect to the electron wave 
functions; the diagonal element prn is zero, by symmetry. 

The electron functions ¢, vary appreciably only over atomic distances, and their differentia- 
tion with respect to r therefore does not introduce the large parameter M/m (where m is the 
electron mass). The quantity Vn, is consequently small in comparison with U,(r), in the 
ratio m//M, and may be omitted. If the terms on the right of (4) are regarded as a small 
perturbation, then in the zero-order approximation the functions yn(R) are given by the 
solutions of the equation 


aT i Uni) xuv = Enexne, (5) 


which describes the motion of the nuclei in the field Un(r) (the v being the quantum numbers 
for this motion). The condition for perturbation theory to be applicable is 


(Kae | ai + V” nmi me>| < [Env es Em. 


The right-hand side of this inequality is the difference of the energies of different electron 
terms; these quantities are of order zero with respect to the smallness parameter m/M. The 
left-hand side contains the matrix elements with respect to the nuclear wave functions. The 
term in Vy; contains the factor m/M and is certainly small. In the matrix element of V’nm, 
the operator p, acting on the function xmv, multiplies it by a quantity of the order of the 
momentum of the nuclei. If the nuclei execute small oscillations, their momentum 
~ V(Mhwe); since the frequency we is itself inversely proportional to VM, the matrix 
element (nv’|V’nn.lmv> is of the order (m; M)™ +. 


§83. Multiplet terms. Case a 


Let us now turn to the question of the classification of molecular levels with 
non-zero spin S. In the zero-order approximation, when relativistic effects 
are entirely neglected, the energy of the molecule, like that of any system of 
particles, is independent of the direction of the spin (the spin is “‘free’’), and 
this results in a (2S+1)-fold degeneracy of the levels. When relativistic 
effects are taken into account, however, the degenerate levels are split, and 
the energy consequently becomes a function of the projection of the spin on 
the axis of the molecule. We shall refer to relativistic interactions in mole- 
cules as the spin—axis interaction. The chief part in this ts played (as in the 


case of atoms) by the interaction of the spins with the orbital motion of the 
electrons.f 


t Besides the spin-orbit and spin-spin interactions there is also an interaction of the spin 
and orbital motion of the electrons with the rotation of the molecule. This part of the inter- 


action is very small, however, and it is of possible interest only for terms with spin S = } 
{see §84). 
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The nature and classification of molecular levels depend markedly on the 
relative parts played by the interaction of the spin with the orbital motion, 
on the one hand, and the rotation of the molecule, on the other. The part 
played by the latter is characterized by the distances between adjacent rota- 
tional levels. Accordingly, we have to consider two limiting cases. In one, 
the energy of the spin—axis interaction is large compared with the energy 
differences between the rotational levels, while in the other it is small. ‘The 
first case is usually called case (or coupling type) a, and the second is called 
case b (F. Hund 1933). 

Case a is the one most often found. An exception is formed by the & 
terms, where case b chiefly occurs, since the effect of the spin—axis interaction 
is very small for these termst (see below). For other terms, case b is some- 
times found in the lightest molecules, since the spin—axis interaction is here 
comparatively weak, while the distances between the rotational levels are 
large (the moment of inertia being small). 

Of course, cases intermediate between a and b are also possible. It must 
also be borne in mind that the same electron state may pass continuously 
from case a to case b as the rotational quantum number changes. This is due 
to the fact that the distances between adjacent rotational levels increase with 
the rotational quantum number, and hence, when this is large, the distances 
may become large compared with the energy of the spin—axis coupling (case 
b), even if case a is found for the lower rotational levels. 

In case a, the classification of the levels is in principle little different from 
that of the terms with zero spin. We first consider. the electron terms for 
nuclei at rest, i.e. we neglect rotation entirely; besides the projection A of the 
orbital angular momentum of the electrons, we must now take into account the 
projection of the total spin on the axis of the molecule. This projection is 
denoted byt =; it takes the values S, S—1,.... — S. We arbitrarily regard 
x as positive when the projection of the spin is in the same direction as 
that of the orbital angular momentum about the axis (we recall that A denotes 
the absolute value of the latter). The quantities A and & combine to give 
the total angular momentum of the electrons about the axis of the molecule: 


Q =A+E; (83.1) 


this takes the values A+S, A+S—1,...,A—S. Thus the electron term 
with orbital angular momentum A is split into 2S+1 terms with different 
values of Q; this splitting, as with atomic terms, is called the fine structure 
or multiplet splitting of the electron levels. The value of Q is usually indicated 
as a suffix to the symbol for the term: thus, for A = 1, S = 3 we obtain the 
terms "II; /2, *I 15/9. 

When the motion of the nuclei is taken into account, vibrational and rota- 
tional structures appear in each of these terms. The various rotational levels 
are characterized by the values of the quantum number J, which gives the 


+ A special case is the normal electron term of the molecule O, (the term *Z). For this we 
have a type of coupling intermediate between a and b (see §84, Problem 3). 
t Not to be confused with the symbol for terms with A = 0. 
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total angular momentum of the molecule, including the orbital and spin 
angular momenta of the electrons and the angular momentum of the rotation 
of the nuclei.t This number takes all integral values from |Q| upwards: 


J >İ\Ql, (83.2) 


which is an obvious generalization of (82.6). 

Let us now derive quantitative formulae to determine the molecular levels 
in case a. First of all, we consider the fine structure of an electron term. 
In discussing the fine structure of atomic terms in §72, we used formula 
(72.4), according to which the mean value of the spin-orbit interaction is 
proportional to the projection of the total spin of the atom on the orbital 
angular momentum vector. Similarly, the spin—axis interaction in a diatomic 
molecule (averaged over electron states for a given distance r between the 
nuclei) is proportional to the projection È of the total spin of the molecule on 
its axis, So that we can write the split electron term in the form 


U(r)+A(r) 5, 


where U(r) is the energy of the original (unsplit) term, and A(r) is some func- 
tion of r; this function depends on the original term (and in particular on A), 
but not on È. Since one usually uses the quantum number Q and not F, 
it is more convenient to put AQ in place of AZ; these expressions differ by 
AA, which can be included in U(r). Thus we have for an electron term the 


expression 
U(r)+ A(r)Q. (83.3) 


We may notice that the components of the split term are equidistant from 
one another: the distance between adjacent components (with values of Q 
differing by unity) is A(r), independent of Q. 

It is easy to see from general considerations that the value of A for © terms 
is zero. To show this, we perform the operation of changing the sign of the 
time. The energy must then remain unchanged, but the state of the mole- 
cule changes in that the direction of the orbital and spin angular momenta 
about the axis is reversed. In the energy A(r)~, the sign of X is changed, and 
if the energy remains unchanged A(r) must change sion. If A #0, we can 
draw no conclusions regarding the value of A(r), since this depends on the 
orbital angular momentum, which itself changes sign. If A = 0, however, 
we'can say that A(r) is certainly unchanged, and consequently it must vanish 
identically. Thus, for the © terms, the spin—orbit interaction causes no split- 
ting in the first approximation; splitting (proportional to £2) would occur 
only on taking account of this interaction in the second approximation or the 
spin-spin interaction in the first approximation, and would be relatively 


small. This is the reason for the fact, already mentioned, that case b usually 
occurs for & terms. 


t The notatior. K ts, as usual, reserved for the total 
without allowance for its spin. In case a there is no qu 
momentum K is not even approximately conserved. 


angular momentum of the molecule 
antum number K, since the angular 
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When the multiplet splitting has been determined, we can take account of 
the rotation of the molecule as a perturbation, just as in the derivation given 
at the beginning of §82. The angular momentum of the rotation of the nuclei 
is obtained from the total angular momentum by subtracting the orbital 
angular momentum and spin of the electrons. Hence the operator of the 
centrifugal energy now has the form 


Bir J—L—S)?. 


Averaging this quantity with respect to the electron state and adding to 
(83.3), we obtain the required effective potential energy U(r): 


Us(r) = Ur) + A()Q+B(r\(J—L—S)? 
= U(r) +A(r)Q4 B(r)[J®—2J .(L+$)+L?2+21L . S+ 82]. 
The eigenvalue of J? is J(J +1). Next, by the same argument as in §82, we 


have 


Loan Pan (83.4) 
and also (J/-L—S).n = 0, whence we have for the eigenvalues 
ee ere een oy (83.5) 
Substituting these values, we find 


Uzr) = Ur) + A(O+B(NLJ(J41)—2024-L?4+2L . $+ 82]. 


The averaging with respect to the electron state is effected by means of the 
wave functions of the zero-ordert approximation. In this approximation, 
however, the magnitude of the spin is conserved, and hence S$? = S(S +1). 
The wave function is the product of the spin and coordinate functions ; hence 
the averaging of the angular momenta L and S takes place independently, and 
we obtain 


Barnes =A: 


Finally, the mean value of the squared orbital angular momentum L? is 
independent of the spin, and is some function of r characterizing the given 
(unsplit) electron term. All the terms which are functions of r but indepen- 
dent of J and È can be included in U(r), while the term proportional to 2 
(or, what is the same thing, to Q) can be included in the expression A(r)Q. 
Thus we have for the effective potential energy the formula 


Uz) = UHABA JJ +1)—20']. (83.6) 


+ That is, the zero-order approximation with respect to both the effect of the rotation of 
the molecule and the spin—axis interaction. 
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The energy levels of the molecule can be obtained from this by the same 
method as in §82 when using the formula (82.5). Expanding U(r) and A(r) 
in series of powers of £, and retaining the terms up to and including the 
second order in the expansion of U(r), but only the terms of zero order in the 
second and third terms, we obtain the energy levels in the form 


E = UV, +4 Q+ holot} +B JJ +1)—20?], (83.7) 


where Ae = A(re) and Be are constants characterizing the given (unsplit) 
electron term. On continuing the expansion to higher terms, we obtain a 
series of terms in higher powers of the quantum numbers, but we shall not 
pause to write these out here. 


§84. Multiplet terms. Case b 


Let us now turn to case b. Here the effect of the rotation of the molecule 
predominates over the multiplet splitting. Hence we must first consider the 
effect of rotation, neglecting the spin-axis interaction, and then the latter 
must be taken into account as a perturbation. 

In a molecule with “free” spin, not only the total angular momentum J 
but also the sum K of the orbital angular momentum of the electrons and the 
angular momentum of the nuclei are conserved; the latter is related to J by 


J =K+S. (84.1) 


The quantum number K distinguishes different states of a rotating molecule 
with free spin that are obtained from a given electron term. The effective 
potential energy Ug(r) in a state with a given value of K is evidently deter- 
mined by the same formula (82.5) as for terms with S = 0: 


Ux(r) = U(r)+B(r)K(K+1), (84.2) 


where K takes the values A, A+1,.... 

When the spin-axis interaction is included, there is a splitting of each 
term into 2S+1 terms in general (or 2K+1 if K < S), which differ in the 
value of the total angular momentumt J. According to the general rule for 


the addition of angular momenta, the number J takes (for a given K) values 
from K+S to |K—S|: 


|JK—S| <J < K+S. (84.3) 


To calculate the energy of the splitting (in the first approximation of per- 
turbation theory), we must determine the mean value of the operator of the 
spin-axis interaction energy for the state in the zero-order approximation 
(with respect to this interaction), In the case considered, this means averag- 
ing with respect to both the electron state and the rotation of the molecule 
(for a given r). The result of the first averaging is an operator of the form 


t In case b, the projection n.S of the spin on the axis of the molecule does not have 
definite values, so that there is no quantum number £ (or Q). 
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A(r)n . S, which is proportional to the projection n . S of the spin operator 
on the axis of the molecule. Next we average this operator with respect 
to the rotation of the molecule, taking the direction of the spin vector 
to be arbitrary; then n.§ = .§. The mean value ñ is a vector which, 
from considerations of symmetry, must have the same direction as the 
“vector” K, the only vector which characterizes the rotation of the molecule. 
Thus we can write 


fi = constant x K. 


The coefficient of proportionality is easily determined by multiplying both 
sides of this equation by K; noting that the eigenvalues of n . K and K? are 
respectively A (see (82.4)) and K(K +1), we find 


n.8 = AR .8/K(K+1). 


Finally, the eigenvalue of the product K . S, according to the general formula 
(31.3), is 


K.S =3(+1)—K(K+1)—S(S+1)]. (84.4) 


As a result, we arrive at the following expression for the required mean 
value of the energy of the spin—axis interaction: 


A(ALJ+1)—S(S+1)—K(K+1)]/2K(K+1) 
= AAJ- SJ +S+1)/2K(K+1)—}4(r)A. 


This exp:ession must be added to the energy (84.2). The term }A(r)A, being 
independent of K and J, can be included in U(r), so that we have finally for 
the effective potential energy the expression 


Url) = U(r) + B()K(K+1)+A()AU—S\J +S41)/2K(K+1). (84.5) 


An expansion in powers of E = r—r, gives, in the usual manner, an expres- 
sion for the energy levels of the molecule in case 6: 


E = Uetfiwev+})+BeK(K+1)+AcA(J—S)\Jt+S+1)/2K(K+1). (84.6) 


As has been pointed out in the previous section, the spin-orbit interaction 
for = terms does not give a multiplet splitting in the first approximation, and 
to determine the fine structure we must take into account the spin-spin 
interaction, whose operator is quadratic with respect to the spins of the elec- 
trons. We are at present interested not in this operator itself, but in the result 
of averaging it with respect to the electron state of the molecule, as was done 
for the operator of the spin-orbit interaction. It is evident from considera- 
tions of symmetry that the required averaged operator must be proportional 
to the squared projection of the total spin of the molecule on the axis, 1.e. 
it can be written in the form 


a(r) (8. n)?, (84.7) 
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where a(r) is again some function of the distance r, characterizing the given 
electron state. Symmetry allows also a term proportional to §%, but this is 
immaterial since the absolute value of the spin is just a constant. We shall 
not pause here to derive the lengthy general formula for the splitting due to 
the operator (84.7); in Problem 1 of this section we give the derivation of the 
formula for triplet È terms. 

The doublet È terms form a special case. According to Kramers’ theorem 
(§60), the double degeneracy in a system of particles with total spin S = 4 
certainly persists, even when the internal relativistic interactions in the system 
are fully allowed for. Hence the 22 terms remain unsplit, even when we 
take account of both the spin-orbit and the spin-spin interaction, and in any 
approximation. 

The splitting is obtained here only by taking into account the relativistic 
interaction of the spin with the rotation of the molecule; this effect is very 
small. The averaged operator of this interaction must evidently be of the 
form yK . S, and its eigenvalues are determined by the formula (84.4), in 
which we must put S = 3, J = K+}. Asa result, we obtain for the 25 
terms the formula 


E = U,+he,o+4)+B,K(K+1)44(K+}); (84.8) 


a constant — fy is included in U,. 


PROBLEMS 
PROBLEM 1. Determine the multiplet splitting of a °Z term in case b (H. A. Kramers 1929). 


SOLUTION. The required splitting is determined by the operator (84.7), which must be 
averaged with respect to the rotation of the molecule. We write it in the form aeninkSiSk, 
where &e = a&(ro). Since the vector S is conserved, only the products mne need be averaged. 
According to the formula derived in §29, Problem, we have 


—_ KiKi t+ KK: 
mnk = ee 
(2K—1\(2K +3) 


. 
se } 


here we have not written out the terms proportional to 8;; whose contribution to the energy 
is independent of J and therefore does not cause any splitting of the type under consideration. 
Thus the splitting is given by the operator 


_ ee ae + K,Ki) 
(2K —1)(2K+3) 


Since § commutes with K, 
SiStKiKy = $K, K, = (S . K}, 
where the eigenvalue S . K is given by (84.4). We also have 


S:S RK: = SiS KiKi iSiSperukKı 

(S. K) -H S: Se — SeS i)ieirkR, 
= (S | K) + }eirteirmSmK: 

= (S. KF +8. K. 
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The values J = K, K+ 1 correspond to the three components Ex of t' e triplet 3E (S = 1). 
For the intervals between these components we find 
A+] A 
Eka — Ex = >n Er- - Ex = —x,——. 
+3 2K—1 





PROBLEM 2. Determine the energy of a doublet term (with A #0) for cases intermediate 
between a and b (E. Hill and J. H. van Vleck 1928). 


SOLUTION. Since the rotational energy and the energy of the spin—axis interaction are 
supposed of the same order of magnitude, they must be considered together in perturbation 
theory, so that the perturbation operator is of the form t 


l = BR+An.§ 


As wave functions in the zero-order approximation it is convenient to use those of states in 
which the angular momenta K and J have definite values (i.e. those of case b). Since S = $ 
for a doublet term, the quantum number K, for a given J, can take the values K = J+}. 
To construct the secular equation, we must calculate the matrix elements (nSKJ|V|nSK‘J> 
(n denoting the assembly of quantum numbers defining the electron term), where K,K’ take 
the above values. The matrix of the operator K? is diagonal; the diagonal elements are 
K(K +1). The matrix elements of n.S are calculated from the general formula (109.5), in 
which we must put ji = S,j. = K; the reduced matrix elements of n are given by (87.4). 
Calculation gives the secular equation 


| BU FAM +2) — 4A I+ EM An +2?—A7V(2) +1) i 
lO AVIA] BAJ +2) —)+AAl(2J+1)-E™ 


Solving this equation and adding E“) to the unperturbed energy, we have 
E = Ustod +i) +B +) vB I+)" —-4,B A+j42]; 


a constant 3B¢ is included in Ue. The inequality A. ® Bej corresponds to case a, and the 
opposite one to case b. 


PROBLEM 3. Determine the intervals between the components of a triplet level *2 in a 
case intermediate between a and b. 


SOLUTION. As in Problem 2, the rotational energy and the energy of the spin-spin inter- 
action are considered together in the perturbation theory. The perturbation operator is of 
the form 


V = B,R2+0,n.§)*. 


As wave functions in the zero-order approximation we use those of case b. The matrix elements 
<K|n . S|K^ (we omit all suffixes with respect to which the matrix is diagonal) are again 
calculated from (109.5) and (87.4), this time with A = 0, S = 1. The non-zero elements are 
of the form 


in. SJ-1> = VIJ+1/2J4+)), Jin. SJ +> = VEI2I +0). 

For a given J, the number K can take the values K = J, J+1. For the matrix elements 
<K|V|K’ we find 

JPI) = BeJ(J+ lta J-1VIJ-1) = BAJ- DJ+ adJ + DJ+1), 

J+ UViJ+1> = BdaJ+1XJ+2)+aeJi(2J +1), 

J-AUV 41> = Jt UV -1> = ee + 1). 

t The averaging with respect to vibrations must be done before that with respect to 

rotation. Hence, restricting ourselves to the first terms of the expansions in £, we have 


replaced the functions B(r) and A(r) by the values Be and Ae, and the unperturbed energy 
levels are E™® = Uethwe(v+tt). 
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We see that there are no transitions between states with K = J and those with K = J+1. 
Hence one of the levels is simply E&E, = ¢J|V|J>. The other two (E, £3) are obtained by 
solving the quadratic secular equation formed from the matrix elements for transitions 
between states J+1. Since we are here interested only in the relative position of the com- 
ponents of the triplet, we subtract the constant xe from all three energies E,, Ez, E; Asa 
result we obtain 


E = Be J( J+), 
E23 = Bo(J?+J+1)—te. + V[Be(2J+1)?—*Be+}>,*]. 


In case b (a small), by considering three levels with the same K and different J (J = K, 
K+1), we again obtain the formulae of Problem 1. 


§85. Multiplet terms. Cases c and d 


Besides cases of a and b coupling and those intermediate between them, 
there are also other types of coupling. These originate as follows. The 
occurrence of the quantum number A is due ultimately to the electric interac- 
tion of the two atoms in the molecule, which results in the axial symmetry 
of the problem of determining the electron terms (this interaction in the 
molecule is called the coupling between the orbital angular momentum and 
the axis). The distances between terms with different values of A give a 
measure of the magnitude of this interaction. Previously we have tacitly 
supposed this interaction so strong that these distances are large both com- 
pared with the intervals in the multiplet splitting and compared with those 
in the rotational structure of the terms. There are, however, opposite cases 
where the interaction of the orbital angular momentum with the axis is com- 
parable with or even small compared with the other effects; in such cases, of 
course, we cannot in any approximation speak of a conservation of the pro- 
jection of the orbital angular momentum on the axis, so that the number A 
is no longer meaningful. 

If the coupling of the orbital angular momentum with the axis is small 
in comparison with the spin-orbit coupling, we say that we have case c. It 
is found in molecules which contain an atom of a rare-earth element. These 
atoms are characterized by the presence of f electrons with uncompensated 
angular momenta; their interaction with the axis of the molecule is weakened 
by the deep position of the f electrons in the atom. Cases intermediate be- 
tween the a and c types of coupling are found in molecules consisting of 
heavy atoms. 

If the coupling of the orbital angular momentum with the axis is small 
compared with the intervals in the rotational structure, we say that we have 
case d. This case is found for high rotational levels (with large J) in some elec- 
tron terms of the lightest molecules (He, He2). These terms are characterized 
by the presence in the molecule of a highly excited electron, whose interaction 
with the remaining electrons (or, as we say, with the “core” of the mole- 
cule) is so weak that its orbital angular momentum is not quantized along the 


axis of the molecule (whereas the “core” has a definite angular momen- 
tum Acore about the axis). 


a 
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As the distance r between the nuclei increases, the interaction between the 
atoms is diminished, and finally becomes small compared with the spin-orbit 
interaction within the atoms. Hence, if we consider the electron terms for 
fairly large r, we shall have case c. This must be borne in mind when 
ascertaining the relation between the electron terms of the molecule and the 
States of the atoms obtained as r + oo. In §80 we have already discussed this 
relation, neglecting the spin-orbit interaction. When the fine structure of 
the terms is included, there arises also the question of the relation between 
the values J, and /, of the total angular momenta of the isolated atoms and 
the values of the quantum number Q for the molecule. We shall give the 
results here, without reiterating arguments which are entirely similar to those 
of §80. 

If the molecule consists of different atoms, the possible values oft |Q! 
obtained on combining atoms with angular momenta J,, Jo}, > aaae 
given by the same table (80.1), in which we must put J}, /, in place of Re PS 
and |Q] in place of A. The only difference is that, for half-integral J, + Ja, 
the smallest value of [Q] is not zero as shown in the table, but 4. For integral 
Ji: +Jz, on the other hand, there are 2],+1 terms with Q = 0, for which (as 
for ~ terms when the fine structure is neglected) we have to decide the 
question of sign. If J, and J, are each half-integral, the number 2J,+1 is 
even, and there are equal numbers of terms, which we shall denote by 0+ 
and Q-. If J, and Ją are both integral, however, then /,+1 terms are 0+ and 
J2 are 0- (if ( — 1) + PiP2 = 1) or vice versa (if (~1)}} +PP: = —1). 

If the molecule consists of similar atoms in different states, the resulting 
molecular states are the same as in the case of different atoms, the only 
difference being that the total number of terms is doubled, with each term 
appearing once as an even and once as an odd term. 

Finally, if the molecule consists of similar atoms in the same state (with 
angular momenta J, = J, = J), the total number of states is the same as in 
the case of different atoms, while their distribution in parity is such that, 

if J is integral and Q is even, N, = N +1; 

if J is integral and Q is odd, N, = N,,; 

if J is half-integral and Q is even, N, = N,; 

if J is half-integral and Q is odd, N, = N,+1. 
All the 0+ terms are even and all the 0- terms odd. 

As the nuclei approach, a coupling of type c usually passes into one of 
type af. Here the following interesting circumstance may arise. As already 
mentioned, the term with A = 0 belongs to case b, and as regards the classi- 
fication of case a this means that multiplet levels with different values of Q 
(and the same A = 0) have the same energy; but such levels can occur on 
the approach of atoms which are in different fine-structure states. 


t In adding the two total angular momenta /,, Ja of the atoms to form the resultant angular 
momentum Q, the sign of Q is clearly immaterial. 
@ J The correspondence between the classification of terms of types a and ¢ cannot be 
derived in a general form. Its derivation necessitates a consideration of the actual potential 
energy curves, taking into account the rule that levels of like symmetry cannot intersect (§79). 
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Thus it may happen that the same molecular term corresponds to different 
pairs of atomic fine-structure states. A similar situation may occur for terms 
with Q = 0 which, on the approach of the nuclei, become a molecular term 
with A # 0 (and therefore & = — A). Such levels are doubly degenerate, 
since in case a the same energy corresponds to the terms 0T and 0- 
(which may arise from different pairs of atomic states).T 


§86. Symmetry of molecular terms 


In §78 we have already examined some symmetry properties of the terms 
of a diatomic molecule. These properties characterized the behaviour of the 
wave functions in transformations which leave the coordinates of the nuclei 
unaltered. Thus the symmetry of the molecule with respect to reflection in 
a plane passing through its axis brings about the difference between X+ and 
È- terms; the symmetry with respect to a change in sign of the coordinates} 
of all the electrons (for molecules composed of like atoms) gives rise to the 
classification of terms into even and odd. These symmetry properties char- 
acterize the electron terms, and are the same for all rotational levels belonging 
to the same electron term. 

The states of the molecule, like those of any system of particles (see §30), 
are characterized by their behaviour with respect to inversion, i.e. a simul- 
taneous change in sign of the coordinates of all the electrons and the nuclei. 
For this reason, all the terms for the molecule can be divided into positive 
(whose wave functions are unaltered when the sign of the coordinates of the 
electrons and nuclei is reversed) and negative (whose wave functions change 
sign on inversion). | 

For A # 0, each term is doubly degenerate, on account of the two possible 
directions of the angular momentum about the axis of the molecule. As a 
result of inversion, the angular momentum itself does not change sign, but 
the direction of the axis of the molecule is reversed (since the atoms change 
places), and hence the direction of the angular momentum A relative to the 
molecule is reversed. Hence two wave functions belonging to the same 
energy level are transformed into each other, and from them we can always 
form a linear combination that is invariant with respect to inversion and one 
that changes sign under this transformation. Thus we obtain for each term 
two states, of which one is positive and the other negative. In practice, 
every term with .\ # 0 is split, however (see §88), and so these two states 
correspond to different values of the energy. 

The £ terms require special consideration to determine their sign. First 





+t We here neglect what is called A-doubling (see §88). 
J The origin is supposed to be taken on the axis of the molecule, and half-way between the 
two nuclei 
We retain the customary terminology. It is unfortunate, however, since in the case of 


an atom the behaviour of the terms with respect to the operation of inversion is referred to as 
parity and not sign. 


The sign of which we are here speaking must not be confused with the + and — which are 
added as indices to È terms. 
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of all, it is clear that the spin bears no relation to the sign of the term: the 
inversion operation changes only the coordinates of the particles, leaving 
the spin part of the wave function unaltered. Hence all the components of 
the multiplet structure of any given term have the same sign. In other words, 
the sign of the term depends only on K, and not on J.+ 

The wave function of the molecule is the product of the electron and 
nuclear wave functions. It has been shown in §82 that, in a © state, the motion 
of the nuclei is equivalent to that of a single particle, of orbital angular 
momentum K, in a centrally symmetric field U(r). Hence we can say that, 
when the sign of the coordinates is changed, the nuclear wave function is 
multiplied by (—1)* (see (30.7)). 

The electron wave function characterizes the electron term, and to ascertain 
its behaviour under inversion we must consider it in a system of coordinates 
rigidly connected to the nuclei and rotating with them. Let x, y, z be a sys- 
tem of coordinates fixed in space, and £, 7, { a rotating system of coordinates 
in which the molecule is fixed. The direction of the axes of €, n, {is defined 
so that the -axis coincides with the axis of the molecule from (say) nucleus 
1 to nucleus 2, and the relative position of the positive directions of the axes 
of £, 7, ¢ is the same as in the system x, y, 2 (i.e. if the system x, y, 2 is left- 
handed, the system £, 7, Ý is so too). As a result of the inversion operation, 
the direction of the axes of x, y, z is reversed, and the system changes from 
left-handed to right-handed. The system £, 7, č must also become right- 
handed, but the ¢-axis, being rigidly connected to the nuclei, retains its 
former direction. Hence the direction of either one of the axes of £, 7 must 
be reversed. Thus the operation of inversion in the fixed system of co- 
ordinates is equivalent in the moving system to a reflection in a plane passing 
through the axis of the molecule. Under such a reflection, however, the 
electron wave function of a E+ term is unaltered, while that of a £- term 
changes sign. 

Thus the sign of the rotational components of a È+ term is determined by 
the factor (—1)¥; all the levels with even K are positive, while those with 
odd K are negative. For a X- term, the sign of the rotational levels is deter- 
mined by the factor (— 1)**+; all levels with even K are negative, while those 
with odd K are positive. 

If the molecule consists of similar atoms, its Hamiltonian is also invariant 
with respect to an interchange of the coordinates of the two nuclei. A term 
is said to be symmetric with respect to the nuclei if its wave function is un- 
altered when they are interchanged and antisymmetric if its wave function 
changes sign. The symmetry with respect to the nuclei is closely related to 
the parity and sign of the term. An interchange of the coordinates of the 
nuclei is equivalent to a change in sign of the coordinates of all the particles 
(electrons and nuclei), followed by a change in sign of the coordinates of the 
electrons only. Hence it follows that, if the term is even and positive (or 


# + We recall that case b usually holds for = terms, and so it is necessary to use the quantum 


numbers K and J. x 
t The two atoms must be not only of the same element, but also of the same isotope. 
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odd and negative), it is symmetric with respect to the nuclei. If, on the other 
hand, it is even and negative (or odd and positive), then it is antisymmetric 
with respect to the nuclei. 

At the end of §62 we have established a general theorem that the coordinate 
wave function or a system of two icenticai particies is symmetrical when the 
total spin of the system is even, and antisymmetrical when it is odd. If we 
apply this result to the two nuclei of a molecule composed of similar atoms, 
we find that the symmetry of a term is related to the parity of the total spin 
I obtained by adding the spins ¿ of the two nuclei. The term is symmetric 
when / is even, and antisymmetric when I is odd.t In particular, if the nuclei 
have no spin (z = 0), Z is zero also; hence the molecule has no antisymmetric 
terms. We see that the nuclear spin has an important indirect influence on 
the molecular terms, although its direct influence (the hyperfine structure of 
the terms) is quite unimportant. 

When the spin of the levels is taken into account, an aaditional degeneracy 
of the levels results. Again in §62, we have calculated the number of states 
with even and odd values of J that are obtained on adding two spins i. Thus, 
when ż is half-integral, the number of states with even J is (21+ 1), and with 
odd Z is (¢+1)(2i+1). From what was said above, we conclude that the 
ratio of the degrees £s, £a of the degeneracy{ of symmetric and antisymmetric 
terms for terms with half-integral č is 


Lika z t/(t+ 1). (86.1) 
For integral z, we similarly find that this ratio is 
Esl8a = (i+ 1)/i. (86.2) 


We have seen that the sign of the rotational components of a X+ term is 
determined by the number (—1)*. Hence, for example, the rotational com- 
ponents of a £+ term for even K are positive, and therefore symmetric, while 
for odd K they are negative and consequently antisymmetric. Bearing in 
mind the results obtained above, we conclude that the nuclear statistical 
weights of the rotational components of a È+, level with successive values 
of K take alternate values, in the ratios (86.1) or (86.2). A similar situa- 
tion is found for £+, X-, and X~, levels. In particular, for i = 0 the 
Statistical weights of levels with even K for £+, and -, terms, and of levels 
with odd K for £+ and =~, terms, are zero. In other words, in the electron 
States £+., E~, there are no rotational states with even K, and in By oy 
States there are none with odd K. 

Because of the extremely weak interaction of the nuclear spins with the 
electrons, the probability of a change in J is very small, even in collisions of 





t Recalling the relation between th 
when the total spin J of the nuclei is 
odd, and vice versa when I is odd. 

t The degree of degeneracy of a level is often refe 
weight. Formulae (86.1), ey determine the ratie 
metric and antisymmetric levels. 


€ parity, sign and symmetry of terms, we conclude that, 
even, the positive levels are even and the negative levels 


rred to in this connection as its statistical 
of the nuclear statistical weights of sym- 
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molecules. Hence molecules differing in the parity of J, and accordingly 
having only symmetric or only antisymmetric terms, behave almost.as differ- 
ent forms of matter. Such, for instance, are orthohydrogen and parahydro- 
gen; in the molecule of the former, the spins i = } of the two nuclei are 
parallel (J = 1), while in that of the latter they are antiparallel (I = 0). 


§87. Matrix elements for the diatomic molecule 


In this section we shall give some general formulae for the matrix elements 
of physical quantities in a diatomic molecule. Let us first consider the matrix 
elements for transitions between states with zero spin. 

Let A be some vector physical quantity for a molecule with fixed nuclei, 
such as its electric or magnetic dipole moment. Let us first consider this 
quantity in coordinates £, ņ, { that rotate with the molecule, the Z-axis being 
along the axis of the molecule. The angular momentum of the molecule 
relative to this system (i.e. the electron angular momentum L) is not con- 
served completely, but its -component is conserved. The selection rules 
for the quantum number L, = A therefore remain valid (and they are the 
same as for M in §29). Thus the non-zero matrix elements of the vector are 


<n'A|A\nA), <n'A|A,+iA,|n, A-1), 


} (87.1) 
a A= ASAA, 
where n numbers the electron terms for the given A. 

If both the terms are È terms, we must also bear in mind the selection 
rule arising from the symmetry with respect to reflection in a plane passing 
through the axis of the molecule. In sucha reflection, the {-component of an 
ordinary (polar) vector is unchanged, while that of an axial vector changes 
sign. Hence we conclude that, for a polar vector, A, has non-zero matrix 
elements only for the transitions E+ + E+ and £- + E-, and for an axial 
vector only for &+ > £-. We need not discuss the components A,, A, 
since for these no transitions without change of A are possible. 

If the molecule consists of similar atoms, there is also a selection rule 
regarding parity. The components of a polar vector change sign under 
inversion. Hence its matrix elements are non-zero only for transitions 
between states of different parity (the reverse is true for an axial vector). In 
particular, all the diagonal matrix elements of the components of a polar 
vector vanish identically. 

The question of the relation between the matrix elements (87.1) and those 
of the same vector in a fixed coordinate system x, y, z is solved by the general 
formulae derived in §110 below for any axially symmetric physical system. 

After separating the dependence (the same for any vector) on the quantum 
number Mx (the z-component of the total angular momentum K of the 
molecule), we are left with the reduced matrix elements <n'K'A'||A||nKA). 
Their relation to the matrix elements (87.1) is given by (110.7) with k = k’ =1 
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(corresponding to a vector) and the appropriate change in the quantum- 
number notation; from (82.4), .\ is equal to the {-component of the total 
angular momentum K. Using the relation (107.1) between the components 
of a spherical tensor of rank one and the Cartesian components of a vector, 
and the values of the 3j-symbols from Table 9 (§106), we obtain the following 
expressions for the matrix elements diagonal with respect to A: 














<n KAWAlInKA> = A Ese Cn’ Al A.\InA), 
K(K + 1) i 
(87.2) 
i . JK? — A2 
<n’, K—1, Al Ala KA) = ‘| = KEN AnA), 
and for those not diagonal with respect to A, 
<n' KA||AllnK, A—1) 
wae ee 1/2 
a, T O eS a) 2] (HN Ae Ae NN, 
4K(K +1) 
<n KA| Aln, K-1, A-1) 
2y 
2 k cael wna AAi (87.3) 


Cn’, K—1, AJAlaK, A—1) 


N kea 1/2 


We nh Ag--iAi ln, A—1). 


The remaining non-zero elements are found from these by means of the 
Hermitian property of the reduced matrix elements: 


(nK Al An K'AY = <n'K’'A'| AllnKA)*, 
and of the matrix elements in the coordinates é, 7m, 6; 


<nA|A,—-iA,|n'A') = (n''|A,+7A,|nA)*, 
(nA|A,|n’A’> = in’ A'|A-lnA>*. 


The following are the particular formulae for the matrix elements of the 
vector A = N, 4 unit vector along the axis of the molecule. In this case, 
we have simply A, = A, = 0, A, = 1, and in the coordinates L 7, € only 
the diagonal elements non-zero, <nA|A.jnA> = 1. The reduced matrix 
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elements are diagonal in all sufhxes except K, and if only this suffix is written 
we have 





7 2K +1 TEN _ „ [K2 AN? 
<KimlK) = A | EAT K= lE = ij =A (874) 


(H. Hönl and F. London 1925). For A = 0, these formulae give 


[Klia K> = 0, <K- Ila] K> = 14/K, 


which agree, as we should expect, with the matrix elements for a unit vector 
in motion in a centrally symmetric field; see (29.14). 

Let us now see how the formulae we have obtained should be modified 
for transitions between states with non-zero spin. Here it is important to 
know whether the states belong to case a or to case b. 

If both states belong to case a, the formulae are changed essentially only 
as regards notation. The quantum numbers K and Mk do not exist, and 
instead we have the total angular momentum J and its projection My on the 
z-axis. There are also the additional numbers S and Q = A+%, so that 
the reduced matrix elements are 


Cn f'S'QO'A'| AllnJSQAY. 


Let A be any orbital vector (i.e. one which does not depend on the spin). 
Its operator commutes with the spin operator §, so that its matrix is diagonal 
with respect to the quantum numbers S and S, = £; the quantum number 
Q = A+2Z therefore changes together with A (ie. Q-Q = A’—A). 
Formulae (87.2)-(87.4) are changed only in that the matrix elements have 
further suffixes, and in the other factors K and A are to be replaced by J and 
Q. For example, the first formula (87.2) becomes 





r 2J+1 ? 
QAJ Aln JQ = Q n'QA|A|2QA 
Kn'JOAl Alngany = Q AIEE ONA nary 


(the diagonal suffix S being omitted). À 
Now, let A = S. Since the spin operator commutes with the orbital 
angular momentum, and also with the Hamiltonian, its matrix is diagonal 
with respect to n and A, but not with respect to S and X (or Q). The matrix 
elements of the components A,, A,, A; for transitions S, 2 > S’, X’ are 
given by formulae (27.13), with S and È in place of L and M. The change 
to the coordinates x, y, zis then made by means of formulae (87.2) and (87.3), 
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with J and Q in place of K and A. Thus we have, for instance, 


Toi A ee 
JASIJ, 2-1) a | 


x gOS SaS} 


~ ee ee eT 
IJ +1) 


(the diagonal suffixes n, S and A being omitted). 

Next, let both states belong to case b, and let A be an orbital vector. The 
calculation of the matrix elements is performed in two stages. First we 
consider the rotating molecule without taking into account the addition of 
S and K; the matrix elements are diagonal with respect to the number S, 
and are determined by the same formulae (87.2), (87.3). In the second stage, 
the angular momentum K is added to S to give the total angular momentum 
J, and the new matrix elements are obtained by the general formulae (109.3), 
with K, S, J in place of jı, j2, J. For example, the elements diagonal with 
respect to J, K and A are first 


cn JKAAlnJKA) 
ay 
se 


and then, with the 6j7-symbol from Table 10 (§108) and the reduced matrix 
element from (87.2), finally 


= (—1)K+9+5+1(2J +1) (n'KA\ AlinKA) 


<n [KA AlnJ KA) 


==! 2j+1 Mod) + DARRI ese) Cn AJA InA> 
Fini 2K(K +1) a 


The calculation of the matrix elements for transitions between states of 
which one belongs to case a and the other to case b is carried out similarly. 
We shall not pause to discuss it here. 


PROBLEMS 


PROBLEM 1. Determine the Stark splitting of the terms for a diatomic molecule having 
a constant dipole moment, in the case where the term belongs to case a. 


f SorvuTion. The energy of a dipole d in an electric field G is —d.&. From considera- 
tions of symmetry, it is evident that the dipole moment of a diatomic molecule is directed 
along its axis; d = dn, where d is a constant. Taking the direction of the field as the z-axis 
we obtain the perturbation operator in the form —dn,€. A 
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Determining the diagonal matrix elements of mz in accordance with the formulae derived 
above, we find thar in case a the splitting of the levels is given by the formulat 


AE = -éd TJQ ](J + ry. 
PROBLEM 2. The same as Problem 1, but for the case where the term belongs to case 6 


(and A Æ 0). 
SOLUTION. By the same method we have 


AF- BPT el 1)—S(S+1)+A(K+1) 
2K(K + DJJ +1) 


PROBLEM 3. The same as Problem 2, but for a! Z term. 


SOLUTION. For A = 0 the linear effect is absent, and we must go to the second approxi- 
mation of perturbation theory. In the summation in the general formula (38.10), it is sufficient 
to retain only those terms which correspond to transitions between rotational components of 
the electron term concerned; for other terms the energy differences in the denominators are 
large. Thus we find 


KEM gin: K — l> Mg)? X KKM gln AK + l, Mrx)|?} 
Ex-Ex-1 Eg- Eks 


’ 


AE = wer) 


where Ex = BK(K +1). A simple calculation gives 


Pe i K(K +1)-3M x2 
B 2K(K4+1)2K-1)(2K +3) 


§88. A-doubling 


The double degeneracy of the terms with A # 0 (§78) is in fact only 
approximate. It occurs only so long as we neglect the effect of the rotation 
of the molecule on the electron state (and also the higher approximations with 
respect to the spin-orbit interaction), as we have done throughout the above 
theory. When the interaction between the electron state and the rotation is 
taken into account, a term with A # Ois split into two levels close together. 
This phenomenon is called A-doubling (E. Hill and J. H. van Vleck, and 
R. de L. Kronig, 1928). 

To consider this effect quantitatively, we again begin with the singlet 
terms (S = 0). We have calculated (in §82) the energy of the rotational 
levels in the first approximation of perturbation theory, determining the 
diagonal matrix elements (i.e. the mean value) of the operator 


Bir) (K—L)?*. 


To calculate the subsequent approximations, we must consider the elements 
of this operator that are not diagonal with respect to A. The operators K2 
and L? are diagonal with respect to A, so that we need consider only the oper- 
ator 2BK . L. 


+ It may seem that there is here a contradiction of the general assertion that there is no 
linear Stark effect (§76). In fact, of course, there is no contradiction, since the presence ofa 
linear Stark effect is here due to the double degeneracy of the levels with Q # 0; the formula 
obtained is therefore applicable provided that the energy of the Stark splitting is large com- 
pared with that of what is called the A-doubling (§88). 
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The calculation of the matrix elements of K.L is conveniently effected 
by means of the general formula (29.12), in which we must put A= K, 
B = L; the parts of L and M are taken by K and Mg, while in place of n we 
must put n, A, where n denotes the assembly of quantum numbers (other 
than A) which determine the electron term. Since the matrix of the vector 
K, which is conserved, is diagonal with respect to n, A, while that of the 
vector L contains non-diagonal elements only for transitions in which A 
changes by unity (cf. what was said in §87 concerning an arbitrary vector A), 
we find, using formulae (87.3), 


(WAKMx|K .Lin, A-1, KM gY 
= nAAL; + iL, n, A-1) V(K+4AK+1-A)]. (88.1) 


There are no non-zero matrix elements corresponding to any greater change 
in A. 

The perturbing effect of the matrix elements with A + A—1 can cause 
the appearance of an energy difference between states with +A only in the 
2Ath approximation of perturbation theory. Accordingly, the effect is pro- 
portional to B*4, i.e. to (m| M^, where M is the mass of the nuciei and m 
that of the electron. For A > 1, this quantity is so small that it is of no 
interest. Thus the A-doubling effect is of importance only for II terms 
(A = 1), which are considered below. 

For A = 1 we must go to the second approximation. The “orrections to 
the eigenvalues of the energy can be determined from the general formula 
(38.10). In the denominators of the terms in the sum occurring in this equa- 
tion we have energy differences, of the form Epag—Epr a-ig- In these 
differences, the terms containing K cancel, since, for a given distance r be- 
tween the nuclei, the rotational energy is the same quantity, B(r)K(K +1), 
for all the terms. Hence the dependence on K of the required splitting AE 
is entirely determined by the squared matrix elements in the numerators. 
Among these are the squared elements for transitions in which A changes 
from 1 to 0 and from 0 to —1; these both give, by (88.1), the same depen- 
dence on K, and we find that the splitting of the HI term is of the form 


åE = constant x K(K+ 1), (88.2) 


where the constant is of the order of magnitude of B?/«, e being the order of 
magnitude of the differences between neighbouring electron terms. 

Let us pass now to terms with non-zero spin (2II and 3J] terms; higher 
values of S are not found in practice). If the term belongs to case 6, the 
multiplet splitting has no effect on the A-doubling of the rotational levels, 
which is determined as before by formula (88.2). 

In case a, however, the effect of the spin is important. Here each electron 
term is characterized by the number Q as well as A. If we simply replace 
A by —A, then Q = A+ is changed, so that we obtain an entirely different 
term. The states with A, Q and —A, —Q are mutually degenerate. This 
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degeneracy can here be removed not only by the effect, considered above, of 
the interaction between the orbital angular momentum and the rotation of 
the molecule, but also by the effect of the spin-orbit interaction. The con- 
servation of the projection Q of the total angular momentum on the axis of 
the molecule is (if the nuclei are fixed) an exact conservation law, and so 
cannot be destroyed by the spin—orbit interaction; the latter can, however, 
change A and È (i.e. there are matrix elements for the corresponding transi- 
tions) in such a way that © remains unchanged. This effect, alone or in 
combination with the orbit—rotation interaction (which alters A but not £), 
may cause A-doubling. 

Let us first consider the °I terms. For the ?II,,. term (A=1, ÈE =—}, 
Q = 4), the splitting is obtained on taking into account simultaneously the 
spin-orbit and orbit—rotation interactions, each in the first approximation. 
For the former gives the transition A = 1, E = —4 > A = 0, © = j, and 
thea the latter converts the state A = 0, E = }intoA = —1, E = 4, which 
differs from the initial state by the signs of A and Q being reversed. The mat- 
rix elements of the spin-orbit interaction are independent of the rotational 
quantum number J, while the dependence of those for the orbit—rotation 
interaction is determined by formula (88.1), in which (under the radical) 
we must replace K and A by J and Q. Thus we have for the A-doubling 
of a *II,,. term the expression 


AE, 2. = constant x (J+), (88.3) 


where the constant is of the order of AB/«. For a ®II,,. term, on the other 
hand, the splitting can be found only in higher approximations, so that in 
practice Az, ..=0. 

Finally, let us consider ĉI{ terms. For a *II, term (A = 1, È = —1), the 
splitting is obtained on taking into account the spin-orbit interaction in the 
second approximation (because of the transitions A = 1, 2 = ~1 > A =0, 
2 = 0 > A = —1, 2=1). Accordingly, the A-doubling in this case is 
entirely independent of J: 


AE, = constant ~ A?Je, (88.4 


For a 311; term, & = 0, and so the spin has no effect on the splitting; hence 
we again have a formula like (88.2), but with K replaced by /: 


AF, = constant x /(7+1). (88.5) 


For a SII, term, higher approximations are needed, sc that we can suppose 
AE, = 0. i 
One of the levels of the doublet resulting from A-doubling is always posi- 
tive, and the other negative; we have already discussed this in §86. An 
investigation of the wave functions of the molecule enables us to establish 
the regularities of the alternation of positive and negative levels. Flere we 
shall give only the results of the investigation.t It is found that if, for some 


+ This may be found in E. Wigner and E. Witmer, Zeitschrift für Physik 51, 859, 1928. 
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value of J, the positive level is below the negative one, then in the doublet 
for J+1 the order is opposite, the positive level being above the negative 
one, and so on; the order varies alternately as the total angular momentum 
takes successive values. We are speaking here of case a terms; for case b, the 
same holds for successive values of the angular momentum K. 


PROBLEM 
Determine the A-splitting for a tå term. 


SoLtution. Here the effect appears in the fourth approximation of perturbation theory. 
Its dependence on K is determined by the products of the four matrix elements (88.1) for 
transitions vith change of A : 2 —> 1, 1 > 0, 0 — —1, —1 —> —2. This gives 

AE = constant x(K—1)K(K+1)(K+2), 


where the constant is of order of Bife? 


§89. The interaction of atoms at large distances 


Let us consider two atoms which are at a great distance from each other 
(relative to their size), and determine the energy of their interaction. In 
other words, we shall discuss the determination of the form of the electron 
terms when the distance between the nuclei is large. 

To solve this problem we apply perturbation theory, regarding the two 
isolated atoms as the unperturbed system, and the potential energy of their 
electrical interaction as the perturbation operator. As we know (see Fields, 
§§41, 42), the electrical interaction.of two systems of charges at a large distance 
r apart can be expanded in powers of 1,7, and successive terms of this expan- 
sion correspond to the interaction of the total charges, dipole moments, 
quadrupole moments, etc., of the two systems. For neutral atoms, the total 
charges are zero. The expansion here begins with the dipole-dipole inter- 
action (~1/r3); then follow the dipole-quadrupole terms (~1/r*), the 
quadrupole-quadrupole (and dipole-octupole) terms (~1/r°), and so on. 

Let us first suppose that both atoms are in the S state. Then it is easily 
seen that there is no interaction between the atoms in the first approximation 
of perturbation theory. The energy of the interaction of the atoms is there 
determined as the diagonal matrix element of the perturbation operator, 
calculated with respect to the unperturbed wave functions of the system 
(expressed in terms of products of the wave functions for the two atoms).f 
In S states, however, the diagonal matrix elements, 1.e. the mean values of 
the dipole, quadrupole, ete., moments, are zero; this follows immediately, 
since the distribution of charge density in the atoms is spherically symmetri- 
cal. Hence each term of the expansion of the perturbation operator in powers 
of 1/r is zero in the first approximation of perturbation theory. 





t Here we neglect the exchange effects, which decrease exponentially with increasing 
distance; cf. §62, Problem 1, and §81, Problem. 
gins. of course, does not imply that the mean value of the interaction energy of the atoms 
is precisely zero. It diminishes exponentially with distance, i.e. more rapidly than every 
finite power of lyr, and hence each term of the expansion vanishes. This occurs because the 
expansion of the interaction operator in terms of the multipole moments involves the assump- 
tion that the charges of the two atoms are at a large distance 7 apart, whereas 
mechanics the electron density distribution has finite (though expónentially S 
even at large distances., 


In quantum 
mall) values 
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In the second approximation it is sufficient to restrict ourselves to the dipole 
interaction in the perturbation operator, since this decreases least rapidly 
as 7 increases, 1.e. to the term 


Vy = [dy = d,—3(di e n)(dz s n)|/r3, (89.1) 


where n is a unit vector in the direction joining the two atoms. Since the 
non-diagonal matrix elements of the dipole moment are in general different 
from zero, we obtain in the second approximation of perturbation theory a 
non-vanishing result which, being quadratic in V, is proportional to 1/76. 
The correction in the second approximation to the lowest eigenvalue is 
always negative (§38). Hence we obtain for the interaction energy of atoms 
in their normal states an expression of the form 


U(r) = —constant/r®, (89.2) 


where the constant is positivet (F. London 1928). 

Thus two atoms in normal S states, at a great distance apart, attract each 
other with a force (—dU/dr) which is inversely proportional to the seventh 
power of the distance. The attractive forces between atoms at large distances 
are usually called van der Waals forces. These forces cause the appearance 
of minima on the potential energy curves of the electron terms even for 
atoms which do not form a stable molecule. These depressions, however, 
are very shallow (being only tenths or even hundredths of an electron-volt 
in depth) and lie at distances several times greater than the distances between 
atoms in stable molecules. 

If only one of the atoms is in the S state, the same result (89.2) is obtained 
for the interaction energy, since, for the first approximation to vanish, it is 
sufficient for the dipole (etc.) moment of only one atom to be zero. The 
constant in the numerator of (89.2) here depends, not only on the states of 
the two atoms, but also on their mutual orientation, i.e. on the value © of the 
projection of the angular momentum on the axis joining the atoms. 

If both atoms have non-zero orbital and total angular momenta, however, 
the situation is changed. The mean value of the dipole moment is zero in 
every state of the atom (§75). The mean values of the quadrupole moment in 
states with L # 0, J # Oor dare not zero, however. Hence the quadrupole- 
quadrupole term in the perturbation operator gives a non-zero result even 
in the first approximation, and the interaction energy of the atoms diminishes 
as the fifth, not the sixth, power of the distance: 


U(r) = constant/7°. (89.3) 
Here the constant may be either positive or negative, i.e. we may have either 
attraction or repulsion. As in the previous case, this constant depends not 


+ As examples, the value of the constant (in atomic units) for two atoms of hydrogen is 
6.5, of helium 1.5, of argon 68, of krypton 130. 
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only on the states of the atoms, but also on the state of the system formed 
by the two atoms. 

A special case is the interaction of two similar atoms in different states. 
‘The unperturbed system (the two isolated atoms) has here an additional de- 
generacy due to the possibility of interchanging the states of the atoms. 
Accordingly, the correction in the first approximation will be given by the 
secular equation, in which the non-diagonal matrix elements of the perturba- 
tion appear as well as the diagonal ones. If the states of the two atoms have 
different parities, and angular momenta L differing by +1 or 0 but not both 
zero (the same restriction being placed on J), then the non-diagonal matrix 
elements of the dipole moment for transitions between these states are in 
general not zero. Hence an effect in the first approximation is obtained from 
the dipole term in the perturbation operator. Thus the interaction energy 
of the atoms is here proportional to 1/r*: 


U(r) = constant/r3, (89.4) 


where the constant may have either sign. 

Usually, however, what is of interest is the interaction of the atoms averaged 
over all possible orientations of their angular momenta (this formulation 
corresponds, for instance, to the interaction of atoms in a gas). With this 
averaging, the mean values of all multipole moments vanish, and so do all 
effects, linear in these moments in the first approximation of perturbation 
theory, in the interaction of the atoms. Hence the averaged interaction 
forces between atoms at large distances always follow the law (89.2).t 

Let us further consider the kindred question of the interaction between an 
atom and an ion. In the first approximation of perturbation theory, this 
interaction is given by the mean value of the operator (76.8), the energy of the 
quadrupole in the Coulomb field of the ion. Since the potential of this field is 
ġ ~ l/r, the atom-ion interaction energy is proportional to l/r. This 
effect exists, however, only if the atom has a mean quadrupole moment. 
Even then, it vanishes on averaging over all directions of the angular momen- 
tum J. 

The next interaction in order of powers of 1/r, which is always non-zero, 
is that in the second order of perturbation theory with respect to the dipole 
operator (76.1). Since the, ion field strength is ~1/r?, the energy of this 
interaction is proportional to 1'r4+. It can be expressed in terms of the 
polarizability x of the atom (in the S state) by 


U = —a6e2/2r4, (89.5) 


If the atom is in its ground state, this energy (like all corrections to the 


t This law, derived on the basis of the non-relativistic theory, is valid only so long as the 
retardation of electromagnetic interactions is unimportant. For this to be so, the distance 7 
between the atoms must be small compared with €/won, where won are the frequencies of 
transitions between the ground state and the excited states of the atom. See ROT, §85, for the 
interaction of atoms when the retardation is taken into account. 
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ground-state energy) is negative, i.e. there is an attraction between the atom 
and the ion.t 


PROBLEM 


Derive a formula giving the van der Waals forces in terms of the matrix elements of dipole 
moments for two like atoms in S states. 


SOLUTION The answer ts obtained by applving the general formula (38.10) of perturbation 
theory to the operator (89.1). On account of the isotropy of the atoms in the S state it is 
evident a priori that, on summation over all intermediate states, the squared matrix elements 
of the three components of each of the vectors dı and de give equal contributions, while the 
terms which contain products of different components give zero. The result is 


Ty o O O 
76 nw" En + En. —2Ep 


where Er and En are the unperturbed values of the energies of the ground state and excited 
states of the atom. Since by hypothesis L = Q in the ground state, the matrix elements 


(dz)on are non-zero only for transitions to P states (L = 1). Using formulae (29.7), we bring 
U(r) to the final form 


2 nld 00>*%¢n'1\d['00>* 
Ur) = -42 Cn Ljia}00> 
3 ER Eni + En:1 — 2E00 
where in the suffixes nL of the energy levels and the reduced matrix elements the second 


suffix gives the value of L and the first represents the assembly of the remaining quantum 
numbers which determine the energy level. 


$90. Pre-dissociation 


A basic premise of the theory of diatomic molecules as given in this chapter 
is the assumption that the wave function of the molecule falls into the product 
of an electron wave function (depending on the distance between the nuclei 
as a parameter) and a wave function for the motion of the nuclei. This sup- 
position amounts to neglecting, in the exact Hamiltonian of the molecule, 
certain small terms corresponding to the interaction of the nuclear and 
electron motions. 

When these terms are taken into account and perturbation theory is applied, 
transitions between different electron states appear. Physically, the transi- 
tions between states of which at least one belongs to the continuous spectrum 
are of particular importance. 

Figure 30 shows curves for the potential energy of two electron terms 
(more precisely, the effective potential energy Uy in some given rotational 
states of the molecule). The energy E’ (the lower dashed line in Fig. 30) is 
the energy of some vibrational level of a stable molecule in the electron 
state 2. In state 1, this energy lies in the range of the continuous spectrum. 
In other words, in passing from state 2 to state 1 the molecule automatically 


+ A similar attraction occurs between an atom and an electron at large distances. This 
attraction is the reason for the ability of the atom to form a negative ion by attachment of an 
electron (with binding energy from a fraction of an electron-volt to several electron-volts). 
Not all atoms possess this property, however, since, in a field which decreases at large distances 
as 1/r* (or 1/7), the number of levels (corresponding to bound states of the electron) is 
always finite and, in particular cases, may be zero. 
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disintegrates; this phenomenon is called pre-dissociation.t As a result of 
pre-dissociation, the state of the discrete spectrum corresponding to curve 
2 has in reality a finite lifetime. This means that the discrete energy level 
is broadened, i.e. acquires a certain width (see the end of §44). 


U(r) 





a, 9, R 
Fic. 30 


If, on the other hand, the total energy E lies above the dissociation limit 
in both states (the upper dashed line in Fig. 30), the transition from one 
state to the other corresponds to what is called a collision of the second kind. 
Thus the transition 1 — 2 signifies the collision of two atoms, as a result of 
which the atoms are left in excited states, and separate with diminished 
kinetic energy (for r > œ, curve 1 passes below curve 2; the difference 
U,(co) — U,(00) is the excitation energy of the atoms). 

Because of the large masses of the nuclei their motion is quasi-classical. 
The problem of determining the probability of the transitions under considera- 
tion is therefore of the kind discussed in §52. From the general considera- 
tions given there we can say that the transition probability will be mainly 
determined by the point at which the transition could occur classically.f 
Since the total energy of the system of two atoms (the molecule) is conserved 
in the transition, the condition for it to be “classically possible” is that the 
effective potential energies, should be equal: Uy(r) = Ujo(r). On account 
of the conservation of the total angular momentum of the molecule also, the 
centrifugal energies are the same in the two states, and so this condition 
means that the potential energies are equal: 


Ui(r) = U(r), (90.1) 


the angular momentum not being involved at all. 


If equation (90.1) has no real roots in the classically accessible region 
(where E > Uji, Uyz), the transition probability according to §52 is expo- 


t Curve l may have no minimum at all if it corresponds to purely repulsive forces between 
the atoms. 


t Or else by the point r = O at which the potential energy becomes infinite. 
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nentially small.t Transitions occur with an appreciable probability only if 
the potential energy curves intersect in the classically accessible region 
(as shown in Fig. 30). Then the exponent in formula (52.1) is zero (and this 
formula is therefore, of course, invalid); accordingly, the transition probability 
is determined by a non-exponential expression which will be derived below. 
The condition (90.1) can then be interpreted as follows. If the potential (and 
total) energies are the same, so are the linear momenta. Hence the condition 
(90.1) may also be written in the form 


ri =f» Pi = Po (90.2) 


where p is the momentum of the relative radial motion of the nuclei, and the 
suffixes 1 and 2 refer to the two electron states. Thus we can say that the 
distance between the nuclei and their relative momentum remain unchanged 
at the instant when the transition occurs (this is called Franck and Condon’s 
principle). Physically, this 1s due to the fact that the electron velocities are 
large compared with those of the nuclei, and ‘“‘during an electron transition”’ 
the nuclei cannot noticeably change their position or velocity. 

It is not difficult to establish the selection rules for the transitions in ques- 
tion. First of all, there are two obvious exact rules. The total angular 
momentum J and the sign of the term (positive or negative; see §86) cannot 
change in atransition. This follows at once from the fact that the conserva- 
tion of the total angular momentum and of the behaviour of the wave function 
under inversion of the coordinate system are exact laws for any (closed) 
system of particles. 

Next, the rule which forbids (for molecules composed of similar atoms) 
transitions between states of unlike parity 1s very nearly accurate. For the 
parity of the state is uniquely determined by the nuclear spin and the sign of 
the term. The conservation of the sign of the term is an exact law, however, 
while the nuclear spin is very nearly conserved by virtue of the weakness of 
its interaction with the electrons. 

The requirement that there should be a point of intersection of the potential 
energy curves means that the terms must be of different symmetry (see §79). 
Let us consider transitions occurring in the first approximation of perturba- 
tion theory; the probability of transitions which occur only in higher approxi- 
mations is relatively small. First of all, we notice that the terms in the 
Hamiltonian which lead to the transitions in question are just those which 
cause the A-doubling of the levels. Among these terms are, firstly, terms 
representing the spin-orbit interaction. They are the product of two axial 
vectors, of which one is of spin character (i.e. is composed of the operators 
of the electron spins), and the other is of coordinate character; we emphasize, 


+ A peculiar situation must occur in the case of a transition involving a molecular term 
which can arise from two different pairs of atomic states (see the end of §85), i.e. when the 
potential energy curve is, as it were, split into two branches with increasing distance. In this 
case the transition probability 1s considerably greater; an example is given by A. I. Voronin 
and E. E. Nikitin, Optics and Spectroscopy 25, 450, 1969. 
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however, that these vectors are not simply the vectors § and L. Hence they 
have non-zero matrix elements for transitions in which S and A change by 
O, +1. The case where AS and AA are both zero (and A # 0) must be 
omitted, since the symmetry of the term would then be unchanged in the 
transition. The transition between two È terms is possible if one of them is 
a Èt term and the other a =~ term; an axial vector has non-zero matrix 
elements only for transitions between 2+ and 2X7 (see §87). 

The term in the Hamiltonian which corresponds to the interaction between 
the rotation of the molecule and its orbital angular momentum is proportional 
to J. L. Its matrix elements are non-zero for transitions with AA = +1 
without change of spin (only the -component of the vector, 1.e. L,, has ele- 
ments with AA = 0, but L, is diagonal with respect to the electron states). 

As well as the terms we have considered, there is also a perturbation due 
to the fact that the operator of the kinetic energy of the nuclei (1.e. the operator 
of differentiation with respect to the coordinates of the nuclei) acts, not only 
on the wave function of the nuclei, but also on the electron function, which 
depends on 7 as a parameter. The corresponding terms in the Hamiltonian 
are of the same symmetry as the unperturbed Hamiltonian. Hence they can 
lead only to transitions between electron terms of like symmetry, the prob- 
ability of which is negligible in view of the non-intersection of these terms. 

Let us go on to the actual calculation of the transition probability. For 
definiteness, we shal] consider a collision of the second kind. According to 
the general formula (43.1), the required probability is given by the expression 


2 


2r 
w = = E. ar dr 3 (90.3) 


where xaue = auc (nue being the wave function of the radial motion of the 
nuclei) and V(r) is the perturbing energy; we have taken, as the quantity vin 
(43.1), the energy E and integrated with respect to it. The final wave func- 
tion ¥puc2 Must be normalized by the delta function of energy. The quasi- 
classical function (47.5), thus normalized, 1s 





2 (1 . 


a, 


The normalizing factor is determined by the rule given at the end of §21. 
The wave function of the initial state can be written in the form 


2 f . ) 
ae cos F fa dr—=łr i (90.5) 





Xnuc.1 = 


Itis normalized so that the current density is unity in each of the two travelling 
waves into which the stationary wave (90.5) can be resolved; v, and v are 
the velocities of the relative radial motion of the nuclei. On substituting 
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these functions in (90.3), we obtain the dimensionless transition probability 
w. It can be regarded as the transition probability for the nuclei to pass 
twice the point 7 = 7, (the point of intersection of the levels). It must be 
borne in mind that the wave function (90.5) corresponds, in a certain sense, 
to a double passage through this point, since it contains both the incident and 
the reflected travelling waves. 

The matrix element of V(r), calculated with respect to the functions 
(90.4), (90.5), contains in the integrand a product of cosines, which can be 
written in terms of the cosines of the sum and difference of the arguments. 
On integrating near the point r = ro, only the second cosine is important, 


so that 
4 V(r) dr dr |? 
af cos| l len id Par Vowa) 


The integral rapidly converges as we move away from the point of intersec- 
tion. Hence we can expand the argument of the cosine in powers of £ = r —ro 
and integrate over € from —œ to +œ (replacing the slowly varying coeffi- 
cient of the cosine by its value at 7 = rọ). Bearing in mind that, at the point 
of intersection, p, = po, we find 


r r 
d 
fa dr— f pe dr xs (2-2 a 
dr, dro 
a, a, 


where Sọ is the value of the difference of the integrals at the point 7 = rẹ 
The derivative of the momentum can be expressed in terms of the force 
F = —dU/dr: differentiating the equation p,?/2u+ U, = p,?/2u + U, (where 
u is the reduced mass of the nuclei), we have v, dp,/dr —v, dp,/dr = F, —Fy. 
Thus 








~~ he 


1— F, 


F 
fa dr— | p dr œ% Sot 


G, a, 





f°, 


where v is the common value of v, and v, at the point of intersection. The 
integration is effected by means of the well-known formula 


[coset dé = i ; cos(«+47), 


™ & 


and as a result we have 


V2 
i —— cos” (+t). (90.6) 
~ Fol F,— F ay ee | 


The quantity S,/f is large and varies rapidly with the energy E. Hence, 
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on averaging over even a small interval of energy, the squared cosine can be 
replaced by its mean value. As a result we obtain the formula 


w = 4nV2holF,—F,| (90.7) 


(L. Landau 1932). All the quantities on the right-hand side of the equation 
are taken at the point of intersection of the potential-energy curves. 

In the application to pre-dissociation, we are interested in the probability 
of the disintegration of the molecule in unit time. In this time, the nuclei 
in their vibrations pass 2(w/27) times through the point r = ro. Hence the 
required pre-dissociation probability is obtained by multiplying w (the 
probability for a double passage) by w/27, i.e. it is 


2V20/hv| F,—F, |. (90.8) 


The following remark must be made concerning these calculations. In 
speaking of the intersection of terms, we have had in mind the eigenvalues 
of the “unperturbed” Hamiltonian A, of the electron motion in the molecule; 
in this, the terms V which lead to the transitions concerned are not taken into 
account. If we include these terms in the Hamiltonian, the intersection of 
the terms becomes impossible, and the curves move apart slightly, as shown 
in Fig. 31. This follows from the results of §79 when regarded from a slightly 
different point of view. 


& 





Fic. 31 


Let U;,(r) and U,,(r) be two eigenvalues of the operator He (in which 7 
is regarded as a parameter). In the region near the point 7, where the curves 
U (r) and U,,(r) intersect, to determine the eigenvalues U(r) of the perturbed 


operator foty we must use the method given in §79, as a result of which 
we obtain the formula 


Ub alr) = (Ugat Oy24+ Vir t+ Foo) V[k(Usr — Use t+ yy — V22)? + Vy 92?], 
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where all quantities are functions of 7; the function U(r) (with the upper 
sign in the formula) corresponds to the upper continuous curve (1° — 2) in 
Fig. 31, and U(r) to the lower one (2'— 1). The matrix elements V11 and 
V22 may be included in the definition of the functions Uy; and Use respec- 
tively; V12 will be denoted by V(r) simply. Then the above formula becomes 


Uv,alr) = 3,0 n+ Use) + eV [(Oa1 — O52)? +472). (90.9) 
The interval between the two levels is now 


AU & s/[(Un1— Uara. (90.10) 


Thus, if there are transitions between the two states (V # 0), the intersection 
of the levels disappears. The minimum distance between the curves occurs 
atr = ro, where Uj, = Uje: 


(AU) min = 2|V(ro)]- (90.11) 


Near this point, we can expand the difference Us, — Use in powers of the 
small difference é = r—70, putting 


Uy, — Ugg = Ui- Ug x &(F2— Fi), 
where F = —(dV/dr),... Then 
AU = V/[(Fe—Fi)* E€ +4V2(ro)]. (90.12) 


For the validity of formulae (90.11) and (90.12), which have been derived 
by consideration of only two states, it is necessary that (AU), should be 
small ın comparison with the distance of the other terms. For the validity 
of (90.7) as the transition probability we must satisfy the condition (90.19) 
below, which in general is a more stringent one. If the latter condition is not 
satisfied, it ıs still permissible to consider only two terms, but ordinary 
perturbation theory is not available for the calculation of the transition 
probability. In that case, a more general treatment is needed. 

If we consider only die neighbourhood of the point of intersection and 
treat the motion of the nuclei quasi-classically, we can replace the velocity 
operator of the nuclei in the Hamiltonian of the system by a constant v, and 
the coordinate r by a function of time satisfying the classical equation dr/dt = 
v, i.e. E = r—ro = vt. The problem of calculating the transition probability 
then reduces to that of solving the wave equation for the electron wave 
functions with a Hamiltonian explicitly dependent on the time: 


thd lêt = [Ao(t) + PANE. (90.13) 


Let Ya and yp be the wave functions of the electron states corresponding to 


§90 Pre-dissociation 351 
the curves a and 6. They are solutions of the equations 


(Ho+V)Ya,0 = Ua,o(t)ba,0, 


in which ¢ is a parameter. The solution of equation (90.13) is sought in the 
form 


F = a(t)bat o(t)po. (90.14) 


If the equation is solved with the boundary condition a = 1, b = 0 as 
t > — œ, then |b(co)|? gives the probability that the molecule enters the 
state Wp, representing a transition from curve a to curve b, as the nuclei 
pass through the point r = rọ. Similarly, |a(co)|? = 1—|b(co)|? is the 
probability that the molecule remains on the curve a. A transition from 
curve a to curve b in a twofold passage through the point ro (as the nuclei 
approach and then recede) can occur in two ways: a >b > 6 (with the 
transition 1 —> 1’ as they approach and the molecule remaining on the curve 
1'2 as they recede), or a >a —> b (with 1 > 2’ as they approach and 2’ ->2 
as they recede). Hence the required probability for such a transition is 


w = 2|B( c0)|2[1 —|b( co) 2], (90.15) 


where we have used the fact that the transition probability in a passage 
through the point r = ro is of course independent of the direction of motion. 

The value of b( æ) can be found by the method described in §53, without 
making direct use of (90.13).f To do so, we note that the curves of Ua(t) and 
U(t) intersect at the imaginary points 


to = +1—————- = +779. (90.16) 


For large negative ż, the coefficient a(t) in (90.14) has a form that is ‘‘quasi- 
classical with respect to time”: 


- t 


a(t) = exp ix] Ua(t) ar, 


Let us now move from the left half of the real axis in the plane of the complex 
variable ¢ to the right half along a contour on which the ‘“‘quasi-classicality” 


t In 353 the process was assumed to be entirely adiabatic, and accordingly its probability 
was found to be exponentially small. In the present case, however, this condition may be 
violated when the nuclei are in the immediate neighbourhood of the point ro, if their velocity 
v is not sufficiently small. However, it is clear from the analysis in §§52 and 53 that only the 
adtabaticity for large |t], and the possibility of considering just two levels in the system, are 
important as regards the applicability of the method itself. 
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condition is always satisfied; since Ua < Ub», the path used must be in the 
upper half-plane, passing round the point tof (cf. §53). The function a(t) 
then becomes b(ż), with 


iTo ti 


exp a l l Uaļlt) dt + | C(t) ar]| 


IT, 


|b oc)? 


1 


iTo 


9 
= exp ‘zim | AU ar), 


t 


where tı may be taken to be any point on the real axis, for instance tı = 0. 
According to (90.12) we have 


AU = yie Rear, (90.17) 


and the required integral, with the substitution £ = îr, becomes 


i [ver (F2—Fy)2v272] dr = i in 
i ee ye 1 
v|Fo—Fy| 
0 


Thus we have the following final expression for the transition probabil:ty: 


2 2 
w = 2 exp e [D-o SLAR] (90.18) 
hol Fe — Fl hz| Fa — Fil 


(C. Zener 1932). We see that the transition probability becomes small ın two 
limiting cases. For V2 > Řo|F2— F| it is exponentially small (the adiabatic 
case), and for 


V2 < fiv|Fo—F\| (90.19) 


formula (90.18) becomes (90.7). From (90.17) we see that 7 ~ |V’|/|F2— Fy{v 
is the “passage time” for the nuclei through the point of intersection; the 
corresponding frequency w, ~ 1/7. Hence the attainment of the two limiting 
cases mentioned is determined by the relation between fiw, and the charac- 
teristic energy |V| of the problem. 
Finally, let us consider the phenomenon, akin to pre-dissociation, of what 
are called perturbations in the spectra of diatomic molecules. If two discrete 
@ molecular levels E, and E, corresponding to two intersecting electron terms 
are close together, the possibility of a transition between the two electron 
states results in a displacement of the levels. According to the general 
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formula (79.4) of perturbation theory, we have for the displaced levels the 
expression 


HE, + E)E VUE. E) +] Venul) (90.20) 


where Vi? auc 1s the matrix element of the perturbation for the transi- 
tion between the molecular states 1 and 2; the matrix elements Vii nus and 
Vao nu Must, of course, be included in £; and Æ. From this formula we see 
that the two levels are moved apart, being displaced in opposite directions 
(the higher level is raised and the other lowered). The amount of the dis- 
placement is the greater, the smaller the difference |E, —£,]. 

The matrix element Vj. nuc is calculated in exactly the same way as for 
determining the probability of a collision of the second kind. The only 
difference is that the wave functions x,,., and xaue belong to the discrete 
spectrum, and hence must' be normalized to unity. According to (48.3) we 


have 
r 
2% (i | 
rower = = 008) = | pı drl, 
TV Ai 


and similarly for x,,¢2. A comparison with formulae (90.3) to (90.5) 
shows that the matrix element Vy9,_,,, here considered is related to the 
transition probability w for a twofold passage through the point of inter- 
section by 


[Vianu] = @(Aw,/27)(hw,/27). (90.21) 


PROBLEMS 


PROBLEM 1. Determine the total cross-section for collisions of the second kind, as a 
function of the kinetic energy E of the colliding atoms, for transitions pertaining to the spin— 
orbit interaction (L. D. Landau 1932). 


SOLUTION. On account of the quasi-classical motion of the nuclei, we can introduce the 
concept of the impact parameter p (the distance at which the nuclei would pass if there were 
no interaction between them) and define the cross-section dg as the product of the “target 
area" 27p dp and the transition probability w(p) per collision (cf. Mechanics, §18). The total 
cross-section g is obtained by integrating with respect to p. 

For spin-orbit interaction, the matrix element V(r) is independent of the angular momen- 


tum M of the colliding atoms. We write the velocity v at the point r = ro, where the curves 
Intersect, in the form 


v = Vi(2u(E—U—M?2ur.2)} = V[(2/p(E-U—p*E}r2)), 


Here U is the common value of U, and U, at the point of intersection, p 1s the reduced mass 
of the atoms, and the angular momentum M = ppvæ, where tæ is the relative velocity of the 
atoms at infinity. The zero of energy is chosen so that the interaction energy of the atoms tn 


the "he state is zero at infinity; then E = $yv,,2. Substituting this expression in (90.7), 
we nn 


87? V? pdp 


do = 2npdp.t = $$ ee 
AFL—F il V(2(E— Uap? /r,2)/p] 
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The integration with respect to p must be taken from zero up to the value for which the 
velocity v vanishes. As a result we have 


44/ (2p)? Vr (E—U) 
AFL, E 


PROBLEM 2. The same as Problem 1, but for transitions pertaining to the interaction be- 
tween the rotation of the molecule and its orbital angular momentum (L. D. Landau 1932). 


SoLuTION. ‘The matrix element V is of the form V(r) = MDj/pr?, where D(r) is the matrix 
element of the electron orbital angular momentum. By the same method as in Problem 1 we 
obtain 


16./2n2D? (E-U: 
o — ————— HA _ - - 
3hval FF] E 


PROBLEM 3. Determine the transition probability for energies E close to the value Uy 
of the potential energy at the point of intersection. 


SOLUTION. For small! values of E—Uy, formula (90.7) is inapplicable, since the velocity 
v of the nuclei cannot be regarded as constant near the point of intersection, and hence it 
cannot be taken outside the integral as it was in deriving (90.7). 

Near the point of intersection we replace the curves of Uy,, Uy, by the straight lines 


Uz = Uy-Fing, Ug: = Uz—-Frpé, Eé=r-ro. 


The wave functions xnnc,1 and Xaveg in this region are wave functions of one-dimensional 
motion in a homogeneous field (§24). The calculations are conveniently effected by means 
of wave functions in the momentum representation. The wave function normalized by the 
delta function of energy is of the form (see §24, Problem) 


ie | I 
= ———__— exp: ——(E—U)) p—p5/6n1) -, 
Jaian igr E Ue POH; 


ay 


while the wave function normalized to unt current density in the incident and reflected waves 
is obtained by multiplying by 4/(27A): 


i l t i 
= /-——|(E— U;)p— p3/6p] -. 
V|Fa| is rae oF J j "h 





ay 


On integrating, the perturbing energy (matrix element) V may again be taken outside the 
integral, replacing it by its value at the point of intersection; 


[9] 
=a ed i? 
w = F Jan P| ; 


As a result we obtain 


4nV2(2yu)*? 2880291 Iyo 
w = —— | -- un( =) (z) | 
ht BFF 32) RFI — Fyyr he Fj Fy, 


wnere (£) is the Airy function (see §b of the Mathematical Appendices). For large E— Uy, 
this formula reduces to (90.7). 


PROBLEM 4. Determine the probability of charge exchange in a distant slow (relative 
velocity v<1) collision between a hydrogen atom and a hydrogen ion (proton) (O. B. Firsov 
1951).T 


SoLuTIon. We shall regard the system H+H* as a molecular hydrogen 10n (see §81, 
® Problem). Charge exchange consists in the transfer of the electron from a state wb, localized 


+ In this problem, atornic units are used. 
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at the first nucleus to a state ý, near the second nucleus. These are not stationary states, even 
when the nuclei are at rest. The stationary states are 


You = write + ye). 


Their energies are Up,u(R) as functions of the distance R between the nuclei. When the 
nuclei are in a given slow motion (which we regard as classical), these energies are slowly 
varying functions of the time, and the time dependence of the wave functions is given by the 
factors ‘“quasi-classical with respect to time” 


exp (~1 f Cy u(t) dt); 


cf. §53. The superposition of the two states that is equal to #, when t = —© is 
t t 
Y= wie eS ge U, dt) + yu ap wil Uu ar) 


When t >œ, this function ıs a linear combination, of the form c, $, + C22, and the probability 
of charge exchange is w = |c,{*. A simple calculation gives 


te) 
w=sinty, y= ! f (Uu-U,)de. 


= 90 


In a collision with large impact parameters p (which are important at a sufficiently low 
velocity v), the motion of the nuclei may be assumed to take place -in a straight line, with R = 
V(p'+v7r). The difference U,—U, for R>1 is given by formula (4) in §81, Problem. Then 


For p>1, the important range of values of R in the integral is that near the lower limit; 
putting R = p(1 +x), we obtain 





= EVEN PY Ie”, 
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CHAPTER -AN 


FHE THEORI- OF SYMMETRI 


§91. Symmetry transformations 


Tue classification of terms in the polyatomic molecule is fundamentally 
related to its symmetry, as in the diatomic molecule. Hence we shall begin 
by examining the types of symmetry which a molecule can have. 

The symmetry of a body is deternuned by the assembly of all those re- 
arrangements after which the body is unaltered; these rearrangements are 
called symmetry transformations. Any possible symmetry transformation can 
be represented as a combination of one or more of the three fundamental 
types of transformation. These three essentially different types are: the 
rotation of the body through a definite angle about some axis, the reflection 
of it in some plane, and the parallel displacement of the body over some 
distance. Of these, the last evidently is applicable only to an infinite medium 
(a crystal lattice). A body of finite dimensions (in particular, a molecule) 
can be symmetrical only with respect to rotations and reflections. 

If the body is unaltered on rotation through an angle 27/n about some 
axis, then that axis is said to be an axis of symmetry of the nth order. The 
number n can take any integral value: 1 = 2,3,.... The value n = 1 
corresponds to a rotation through an angle of 27 or, what is the same thing, 
of 0, i.e. it corresponds to an identical transformation. We shall symbolically 
denote by C,, the operation of rotation through an angle 27/n about a given 
axis. Repeating this operation two, three, ... times, we obtain rotations 
through angles 2(2z/n), 3(2z/n), ..., which also leave the body unaltered; 
these rotations may be denoted by C,?, C,3,.... It is obvious that, if p 
divides 7, 


CaP = C5 (91.1) 


In particular, performing the rotation n times, we return to the initial position, 
i.e. we effect an identical transformation. The latter is customarily denoted 
by E, so that we can write 


eee. (91.2) 


If the body is left unaltered by a reflection in some plane, this plane is said 
to be a plane of symmetry. We shall denote by the symbol o the operation 
_of reflection in a plane. It is evident that a double reflection in the same 
plane is the identical transformation : 
fener ae (91.3) 
2356 
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A simultaneous application of the two transformations (rotation and 
reflection) gives what are called the rotary-reflection axes. A body has a 
rotary—reflection axis of the nth order if it is.left unaltered by a rotation 
through an angle 27/n about this axis, followed by a reflection in a plane 
perpendicular to the axis (Fig. 32). It is easy to see that this is a new form 





Fic. 32 


of symmetry only when n is even. For, if 7 is odd, an n-fold repetition of the 
rotary—reflection transformation would be equivalent to a simple reflection 
in a plane perpendicular to the axis (since the angle of rotation is 27, while 
an odd number of reflections in the same plane amounts to a simple reflection). 
Repeating this transformation a further n times, we have as a result that the 
rotary—reflection axis reduces to the simultaneous presence of an axis of 
symmetry of the mth order and an independent plane of symmetry perpen- 
dicular to this axis. If, however, n is even, an n-fold repetition of the rotary- 
reflection transformation returns the body to its initial position. 

We denote the rotary—reflection transformation by the symbol Sy. 
Denoting by o, a reflection in a plane perpendicular to a given axis, we can 
put, by definition, 

CBee: Soe: (91.4) 


the order in which the operations C,, and o, are performed clearly does not 
affect the result. 

An important particular case ‘is a rotary—reflection axis of the second 
order. It is easy to see that a rotation through an angle ~, followed by a 
reflection in a plane perpendicular to the axis of rotation, is the inversion 
transformation, whereby a point P of the body is carried into another point 
P’, lying on the continuation of the line which joins P to the intersection O 
of the axis and the plane, and such that the distances OP and OP’ are the same. 
A body symmetrical with respect to this transformation is said to have a 


centre of symmetry. We shall denote the operation of inversion by J, so that 
we have 


I= S,='C,c),. (91.5) 


It is also evident that Io, = C,, IC, = an; in other words, an axis of the 
second order, a plane of symmetry perpendicular to it and a centre of sym- 
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metry at their point of intersection are mutually dependent: if any two of 
these elements are present, the third is automatically present also. 

We shall now give var'ous purely geometrical properties of rotations and 
reflections which it is useful to bear in mind in studying the symmetry of 
bodies. 

A product of two rotations about axes intersecting at some point is a 
rotation about some third axis also passing through that point. A product 
of two reflections in intersecting planes is equivalent to a rotation; the axis 
of this rotation is evidently the line of intersection of the planes, while 
the angle of rotation is easily seen, by a simple geometrical construction, to 
be twice the angle between the two planes. If we denote a rotation through 
an angled about an axis by C(¢), and reflections in two planes passing through 
that axis by the symbols} c,„and o’,, the above statement can be written as 


Oyo, = C(2¢), (91.6) 


where ¢ is the angle between the two planes. It must be noted that the order 
in which the two reflections are performed is not immaterial. The trans- 
formation ¢,o’, gives a rotation in the direction from the plane of o’, to 
that of o,; on interchanging the factors we have a rotation in the oppo- 
site direction. Multiplying equation (91.6) on the left by c}, we obtain 


Gis = CO: : (91.7) 


in other words, the operation of rotation, followed by refiection in a plane 
passing through the axis, is equivalent to a reflection in another plane 
intersecting the first at half the angle of rotation. In particular, it follows 
from this that an axis of symmetry of the second order and two mutually 
perpendicular planes of symmetry passing through it are mutually dependent; 
if two of them are present, so is the third. 

We shall show that the product of rotations through an angle 7 about two 
axes intersecting at an angle ¢ (Oa and Ob in Fig. 33) is a rotation through 
an angle 2¢ about an axis perpendicular to the first two (PP’ in Fig. 33). 


P 





P’ 
Fic. 33 


+ The suffix v customarily denotes a reflection in a plane passing through a given axis (a 
“vertical”? plane}, and the sufhx h a reflection in a plane perpendicular to the axis (a 


“horizontal” plane). 
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For it is obvious that the resulting transformation is also a rotation; after 
the first rotation (about Oa) the point P is carried into P’, and after the second 
(about Ob) it returns to its original position. This means that the line PP’ 
remains fixed, and is therefore an axis of rotation. To determine the angle 
of rotation, it is sufficient to note that, in the first rotation, the axis Oa 
remains fixed, while after the second it takes the position Oa’, which makes 
an angle 2¢ with Oa. In the same way we can see that, when the order of the 
two transformations is reversed, we obtain a rotation in the opposite direction. 
Although the result of two successive transformations in general depends 
on the order in which they are performed, in some cases the order of opera- 
tions is immaterial: the transformations commute. This is so for the following 
transformations: 
(1) Two rotations about the same axis. 
(2) Two reflections in mutually perpendicular planes (equivalent to a 
rotation through ~v about their line of intersection). 
(3) Two rotations through z about mutually perpendicular axes (equivalent 
toʻa rotation through ~v about the third perpendicular axis). 
(4) A rotation and a reflection in 2 plane perpendicular to the axis of 
rotation. 
(5) Any rotation or reflection and an inversion with respect to a point 
lying on the axis of rotation or in the plane of reflection; this follows 
from (1) and (4). 


§92. Transformation groups 


The set of all the symmetry transformations for a given body is called its 
symmetry transformation group (or simply its symmetry group). Hitherto we 
have spoken of these transformations as geometrical rearrangements of the 
body. However, in quantum-mechanical applications it is more convenient 
to regard symmetry transformations as transformations of the coordinates 
which leave the Hamiltonian of the system in question invariant. It is obvious 
that, if the system is left unaltered by some rotation or reflection, the cor- 
responding transformation of the coordinates does not change its Schrédin- 
Ber’s equation. Thus we shall speak of a transformation group with respect to 
which a given Schrédinger’s equation is invariant. 





t This point of view enables us to include in our considerations not only the rotation and 
reflection groups discussed here, but also other types of transformation which leave Schrédin- 
ger’s equation unaltered. These include the interchange of the coordinates of identical 
particles forming part of the system considered (a molecule or atom). The set of all possible 
permutations of identical particles in a given system ts called its permutation group (we have 
already met these permutations in §63). The general properties of groups given below apply 
to permutation groups also; we shall not pause to study this type of group in more detail here. 

The following remark should be made concerning the notation which we use in this 
chapter. Symmetry transformations are essentially operators just like those which we con- 
sider all through the book. They ought, therefore, to be denoted by letters with circurmflexes. 
We do not do this, in view of the generally accepted notation, and because this omission 
cannot lead to misunderstandings in the present chapter. For the same reason we denote the 
identical transformation by the customary symbol Æ, and not by 1, which would correspond 
to the notation in the other chapters. Lastly, the inversion operator is denoted in this chapter 


by J, instead of P as in §30, although the latter is customary in recent literature on quantum 
mechanics. 
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Symmetry groups are conveniently studied with the help of the general 
mathematical techniques of what is called group theory, the fundamentals of 
which we shall explain below. At first we shall consider groups, each of which 
contains a finite number of transformations (known as finite groups). Each of 
the transformations forming a group is said to be an element of the group. 

Symmetry groups have the following important properties. Each group 
contains the identical transformation E (called the unit element of the group). 
The elements of a group can be multiplied by one another; by the product 
of two (or more) transformations we mean the result of applying them in 
succession. It is obvious that the product of any two elements of a group is 
also an element of that group. For the multiplication of elements we have 
the associative law (AB)C = A(BC), where A, B, C are elements of a group. 
There is evidently no general commutative law; in general, AB 4 BA. 
For each element 4 of a group there is in the same group an inverse element 
A~ (the inverse transformation), such that AA-!= E. In some cases an 
element may be its own inverse; in particular, E~! = E. It is evident that 
mutually inverse elements A and A~! commute. 

The element inverse to the product AB of two elements is 


(ABY = BA, 


and similarly for the product of a greater number of elements; this is easily 
seen by effecting the multiplication and using the associative law. 

If all the elements of a group commute, the group is said to be Abelian. 
A particular case of Abelian groups is formed by what are called cyclic groups. 
By a cyclic group we mean a group, all of whose elements can be obtained by 
raising one of them to successive powers, 1.e. a group consisting of the 
elements 

Ae eA” =F, 


where 7% is some integer. 

Let G be some group.t If we can separate from it some set of elements 
H such that the latter is itself a group, then the group H is called a sub-group 
of the group G. A given element of a group may appear in several of its 
sub-groups. i 

By taking any element A of a group and raising it to successive powers, 
we finally obtain the unit element (since the total number of elements in the 
group is finite). If n is the smallest number for which A" = E, then 7 is 
called the order of the element A, and the set of elements A, 4?,..., ÆA” = E 
is called the period of A. The period is denoted by {4}; it is itself a group, i.e. 
it is a sub-group of the original group, and is cyclic. 

In order to find whether a given set of elements of a group is a sub-group 
of it, it is sufficient to find whether, on multiplying any two of its elements, 
we obtain another element of the set. For in that case we have, together with 
& each element A, all its powers, including A”-! (where n is the order of A), 


+ We shall denote groups by bold italic letters. 
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which is the inverse of A (since A”-! A = A" = E); and there will obviously 
be a unit element. 

The total number of elements in a group is called its order. It is easy 
to see that the order of a sub-group is a factor of the order of the whole group. 
To show this, let us consider a sub-group H of a group G, and let G, be 
some element of G which does not belong to H. Multiplying all the elements 
of H (on the right, say) by G,, we obtain a set (or complex, as it is called) 
of elements, denoted by HG. All the elements of this complex clearly belong 
to the group G. However, none of them belongs to H; for, if for any two 
elements Ha, H, belonging to H we had H,G, = Hy, it would follow that 
Gi = H,1H,, 1.e. G, would also belong to the sub-group H, which is 
contrary to hypothesis. Similarly we can show that, if G, is an element of G 
not belonging to H or to HG,, none of the elements of the complex HG, 
will belong to H or to HG,. Continuing this process, we finally exhaust 
all the elements contained in the finite group G. Thus all the elements are 
divided among the complexes (called the cosets of H in G) 


H, HG,, HG., ..., HG, 


each of which contains h elements, h being the order of the sub-group H. 
Hence it follows that the order g of the group G is g = hm, and this proves 
the theorem. The integer m = g/h is called the index of the sub-group H 
in the group G. 

If the order of a group is a prime number, it follows at once from the 
above that the group has no sub-groups (except itself and £). The converse 
theorem is also valid: a group having no sub-groups is of prime order and in 
addition must be cyclic (since otherwise it would contain elements whose 
period would form a sub-group). 

We shall now introduce the important concept of conjugate elements. Two 
elements A and B are said to be conjugate if 


A = Cee". 


where C is also an element of the group; multiplying this equation on the 
right by Cand on the left by C1, we have the converse equation B = C-1AC. 
An important property of conjugate elements is that, if A is conjugate to B, 
and B ito C, then-4 is conjugate*to C; for, if B= P4~AP, C=O7BO 
(P and Q being elements of the group), it follows that C = (PQ)-A(PQ). 
For this reason we can speak of sets of conjugate elements of a group. 
Such sets are called classes of conjugate elements, or simply classes, of the 
group. Each class is completely determined by any one element A of it; 
for, given A, we obtain the whole class by forming the products GAG~-}, 
where G is successively every element of the group (of course, this may 
give each element of the class several times). Thus we can divide the whole 
group into Classes; each element of the group can clearly appear in only 
one class. The unit element of the group is a class by itself, since for every 
element of the group CEG- -1 = E- If the group is Abelian, each of its 
elements 1s a Class by itself; since all the elements, by definition, commute, 
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each element is conjugate only to itself. We emphasize that a class of a group 
(not being £) is not a sub-group of it; this is evident from the fact that it 
does not contain a unit element. 

All the elements of a given class are of the same order. For, if z is the 
order of the element 4 (so that A” = E), then for a conjugate element 
B= CAC * weenave (CAC) = CAE *— £, 

Let H be a sub-group of G, and G, an element of G not belonging to H. 
It is easy to see that the set of elements G,HG,~*has all the properties of a 
group, i.e. it also is a sub-group of the group G. The sub-groups H and 
G,HG," are said to be conjugate; each element of one is conjugate to one 
element of the other. By giving G, various values, we obtain a series of 
conjugate sub-groups, which may partly coincide. It may happen that all 
the sub-groups conjugate to H are H itself. In this case H is called a normal 
divisor or invariant sub-group of the group G. Thus, for example, every 
sub-group of an Abelian group ts clearly a normal divisor of it. 

Let us consider a group A with z elements A, A’, A”,..., and a group 
B with m elements B, B’, B’’,..., and suppose that all the elements of A 
(apart from the unit E) are different from those of B but commute with 
them. If we multiply every element of group A by every element of group B, 
we obtain a set of nm elements, which also form a group. For, for any two 
elements of this set we have AB. A'B' = AA’. BB' = A” B”, i.e. another 
element of the set. The group of order nm thus obtained is denoted by 
A XB, and is called the direct product of the groups A and B. 

Finally, we shall introduce the concept of the zsomorphism of groups. 
Two groups A and B of the same order are said to be isomorphous if we can 
establish a one-to-one correspondence between their elemer.:s, such that, if 
the element B corresponds to the element A, and B’ to A’, then B” = BB’ 
corresponds to A” = AA’. Two such groups, considered in the abstract, 
clearly have identical properties, though the actual meaning of their elements 
may be different. 


§93. Point groups 


Transformations which appear in the symmetry group of a body of finite 
dimensions (in particular, a molecule) must be such that at least one point of 
the body remains fixed when any of these transformations is applied. In 
other words, all axes and planes of symmetry of a molecule must have at least 
one common point of intersection. For a successive rotation of the body 
about two non-intersecting axes or a reflection in two non-intersecting planes 
results in a translation of the body, which obviously cannot then be left 
unaltered. Symmetry groups having the above property are called point groups. 

Before going on to construct the possible types of point group, we shall 
explain a simple geometrical procedure whereby the elements of a group may 

a be easily divided into classes. Let Oa be some axis, and let the element A 


of the group be a rotation through a definite angle about this axis. Next, let G 
be a transformation (rotation or reflection) in the same group, which on being 


4 
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applied to the same axis Oa carries it to the position Ob. We shall show that 
the element B = GAG™ then corresponds to a rotation about the axis Ob 
through the same angle as that of the rotation about Oa to which the element 
A corresponds. For, let us consider the effect of the transformation GAG-! 
on the axis Ob itself. The transformation G~} inverse to G carries the axis 
Ob to the position Oa, so that the subsequent rotation A leaves it in this 
position; finally, G carries it back to its initial position. Thus the axis Ob 
remains fixed, so that B is a rotation about this axis. Since A and B belong 
to the same class, their orders are the same; this means that they effect 
rotations through the same angle. 

Thus we reach the result that two rotations through the same angle belong 
to the same class if there is, among the elements of the group, a transformation 
whereby one axis of rotation can be carried into the other. In exactly the same 
way, we can show that two reflections in different planes belong to the same 
class if some transformation in the group carries one plane into the other. 
The axes or planes of symmetry whose directions can be carried into each 
other are said to be equivalent. 

Some additional comments are necessary in the case where both rotations 
are about the same axis. The element inverse to the rotation C,* (k = 
1, 2, ...,2—1) about an axis of symmetry of the nth order is the element 
C," = C,,"-*, i.e. a rotation through an angle (n—k)27/n in the same 
direction or, what is the same thing, a rotation through an angle 2kz/n in 
the opposite direction. If, among the transformations in the group, there is 
a rotation through an angle 7 about a perpe.idicular axis (this rotation reverses 
the direction of the axis under consideration), then, by the general rule 
proved above, the rotations C,* and C,,-* belong to the same class. A 
reflection o, in a plane perpendicular to the axis also reverses its direction; 
however, it must be borne in mind that the reflection also changes the direction 
of rotation. Hence the existence of o, does not render C,” and C,-* conju- 
gate. A reflection oc, in a plane passing through the axis, on the other hand, 
does not change the direction of the axis, but changes the direction of rota- 
tion, and therefore C,~* = o,,C,*o,, so that C,” and C,~* belong to the same 
class if such a plane of symmetry exists. If rotations about an axis through the 
same angle in opposite directions are conjugate, we shall call it bilateral. 

The determination of the classes of a point group is often facilitated by the 
following rule. Let G be some group not containing the inversion J, and C; 
a group consisting of the two elements J and E. Then the direct product 
G x C, is a group containing twice as many elements as G; half of them are the 
Same as the elements of the group G, while the remainder are obtained by 
multiplying the latter by J. Since J commutes with any other transformation 
of a point group, it is clear that the group G xC; contains twice as many 
classes as G; to each class A of the group G there correspond the two classes 


A and AJ in the group GxC,. In particular, the inversion J always forms 
a class by itself. 


Let us Now go on to enumerate all possible point groups. We shall con- 
struct these by starting from the simplest ones and adding new elements of 
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symmetry. We shall denote point groups by bold italic Latin letters with 
appropriate sufhxes. 


I. C, groups 

The simplest type of symmetry has a single axis of symmetry of the nth 
order. The group C, is the group of rotations about an axis of the mth order. 
This group is evidently cyclic. Each of its n elements forms a class by itself. 
The group C, contains only the identical transformation E, and corresponds 
to the absence of any symmetry. 


II. Son groups 

The group Se, is the group of rotary reflections about a rotary-reflection 
axis of even order 2n. It contains 2n elements and is evidently cyclic. In 
particular, the group S, contains only two elements, E and J; it is also denoted 
by C;. We may note also that, if the order of a group is a number of the form 
2n = 4p+2, inversion is among its elements; it is clear that (S, p4) P" 
= C o = I. Such a group can be written as a direct product S; p42 = Copii 
xC,; it is also denoted by Cy,,41 ;. 


III. Caa groups 

These groups are obtamed by adding to an axis of symmetry of the nth 
order a plane of symmetry perpendicular to it. The group Cpp contains 
2n elements: n rotations of the group Cy, and n rotary-reflection trans- 
formations C,*c,, k= 1,2)... , (including the reflection Clon = Th). 
All the elements of the group commute, Le. it is Abelian; the number of 
classes is the same as the number of elements. If n is even (n = 2p), the group 
contains a centre 2f symmetry (since C,,?0, = Czop = J). The simplest 
group, Cia: contains only two elements, E and oc}; it is also denoted by C, 


IV. Cn, groups 

If we add to an axis of symmetry of the nth order a plane of symmetry 
passing through it, this automatically gives another n—1 planes intersecting 
along the axis at angles of z/n, as follows at once from the geometrical 
theoremt (91.7). The group Cn, thus obtained therefore contains 27 
elements: 7 rotations about the axis of the mth order, and n reflections oy in 
vertical planes. Figure 34 shows, as an example, the systems of axes and 
planes of symmetry for the groups C,,, and Cip 

To determine the classes, we notice that, because of the presence of planes 
of symmetry passing through the axis, the latter is bilateral. The actual 
distribution of the elements among the classes depends on whether n is 
even or odd. 

If n is odd (n = 2p+1), successive rotations Cp}; carry each of the 
planes successively into each of the other 2p planes, so that all the planes of 


+ Ina finite group, there cannot be two planes of symmetry intersecting at an angle which 
is not a rational fraction of 27. If there were two such planes, it would follow that there were 
an infinite number of other planes of symmetry, intersecting along the same line and obtained 
by reflecting one plane in the other ad infinitum In other words, if there are two such planes, 
there must be complete axial symmetry. 
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symmetry are equivalent, and the reflections in them belong to a single class. 
Among rotations about the axis there are 2p operations apart from the identity, 
and these are conjugate in pairs, forming p classes each of two elements 
(Cop+i* and Capri ¥, k = 1, 2, ..., p); moreover, E forms an extra class. 


Thus there are p+2 classes altogether. 
D; D, 


Fic. 34 


Cay Cay 


If, on the other hand, n is even (n = 2p), only every alternate plane can 
be interchanged by successive rotations Cap; two adjacent planes cannot be 
carried into each other. Thus there are two sets of p equivalent planes, and 
accordingly two classes of p elements (reflections) each. Of the rotations 
about the axis, C,,?” = E and C,,” = C, each form a class by themselves, 
while the remaining 2p —2 rotations are conjugate in pairs and give another 
p— 1 classes, each of two elements. The group Czp v thus nas p+3 classes 
altogether. 


V. D, growps 

If we add to an axis of symmetry of the nth order an axis of the second 
order perpendicular to it, this involves the appearance of a further n—1 
such axes, so that there are altogether n horizontal axes of the second order, 
intersecting at angles r/n. The resulting group D, contains 2n elements: 
n rotations about an axis of the nth order, and n rotations through an angle 
n about horizontal axes (we shall denote the latter by U,, reserving the notation 
C, for a rotation through an angle ~ about a vertical axis). Fig. 34 shows, as 
an example, the systems of axes for the groups D, and D,. 

In an exactly similar manner to case IV, we may verify that the axis of the 
nth order is bilateral, while the horizontal axes of the second order are all 
equivalent if n is odd, or form two non-equivalent sets if n is even. Con- 
sequently, the group Dap has the following p+3 classes: E, 2 classes each of 
Pp rotations U,, the rotation C,, and p—1 classes each of two rotations about 
the vertical axis. The group Dy,,;, on the other hand, has p+2 classes: 
E, 2p+1 rotations U,, and p classes each of two rotations about the vertical 
axis. 


An important particular case is the group Da. Its system of axes is 


composed of three mutually perpendicular axes of the second order. This 
group is also denoted by y. 
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VI. D,,, groups 

If we add to the system of axes of a group D, a horizontal plane of sym- 
metry passing through the n axes of the second order, n vertical planes 
automatically appear, each of which passes through the vertical axis and one 
of the horizontal axes. The group D,,, thus obtained contains 4n elements; 
besides the 27 elements of the group D,, it contains also n reflections c, 
and n rotary—reflection transformations C,*c,. Figure 35 shows the system 
of axes and planes for the group D,,. 


i ro) 
© 
Dan Dog O39 
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The reflection c, commutes with all the other elements of the group; 
hence we can write D,, as the direct product D,, = DaX C, where C, 1s 
the group consisting of the two elements E and o}. For even n the inversion 
operation is among the elements of the group, and we can also write 
Dop h ose "oy xC, 

Hence it follows that the number of classes in the group D,,, is twice the 
number in the group D,. Half of them are the same as those of the group 
D,, (rotations about axes), while the remainder are obtained by multiplying 
these by o}. The reflections o, in vertical planes all belong to a single class 
(if n is odd) or form two classes (if n is even). The rotary—reflection trans- 
formations o,C,* and o,C,~* are conjugate in pairs. 


VII. Daa groups 
There is another way of adding planes of symmetry to the system of axes 
of the group D,. This is to draw vertical planes through the axis of the nth 
order, midway between each adjacent pair of horizontal axes of the second 
order. The adding of one such plane again involves the appearance of another 
(n—1) planes. The system of axes and planes of symmetry thus obtained 
determines the group Dna. Figure 35 shows the axes and planes for the groups 
D,, and D,a- 
The group D, contains 47 elements. To the 2n e1ements of the group 
D, are added n reflections in the vertical planes (denoted by o g—the “‘diago- 
®nal” planes) and n transformations of the form G = Uca In order to 
ascertain the nature of these latter, we notice that the rotation U, can, by 
(91.6), be written in the form U, = opo where o, is a reflection in the verti- 
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cal plane passing through the corresponding axis of the second order. Then 
G =0)0,0, (the transformations o, op, alone are not, of course, among 
the elements of the group). Since the planes of the reflections o, and oc, 
intersect along an axis of the mth order, forming an angle (2k+1)m/2n, 
where k = 1, ..., (a—1) (since here the angle between adjacent planes is 
mj 2n), it follows that, by (91.6), we have ooa = G,,,?*+!. Thus we find that 
G = onCon2*-l = Son?*-}, i.e. these elements are rotary—reflection trans- 
formations about the vertical axis, which is consequently not a simple axis 
of symmetry of the mth order, but a rotary-reflection axis of the 2nth order. 

The diagonal planes reflect two adjacent horizontal axes of the second 
order into each other; hence, in the groups under consideration, all axes of 
the second order are equivalent (for both even and odd n). Similarly, all 
diagonal planes are equivalent. The rotary-reflection transformations 
Son7*+1 and So,-2*-! are conjugate in pairs. 

Applying these considerations to the group D,,, a we find that it contains 
the following 2p+3 classes: E, the rotation C, about the axis of the mth 
order, (p—1) classes each of two conjugate rotations about the same axis, 
one class of the 2p rotations U,, one class of 2p reflections c4, and p classes 
each of two rotary-reflection transformations. 

For odd n (= 2p +1), inversion is among the elements of the group; this 
is seen from the fact that, in this case, one of the horizontal axes is perpen- 
dicular to a vertical plane. Hence we can write Dap41 a = Dopi1XCj, 
so that the group D,,,; a contains 2p+4 classes, which are obtained at 
once from the p+ 2 classes of the group D,,,,). 
VILL. The group T (the tetrahedron group) 

The system of axes of this group is the system of axes of symmetry of a 
tetrahedron. It can be obtained by adding to the system of axes of the group 
V four oblique axes of the third order, rotations about which carry the three 
axes of the second order into one another. This system of axes is conveniently 
represented by showing the three axes of the second order as passing through 
the centres of opposite faces of a cube, and those of the third order as the 
spatial diagonals of the cube. Figure 36 shows the position of these axes in a 
cube and in a tetrahedron (one axis of each type is shown). 





T For we have 


Zk+1 = k 
SgSon Cg = ao,,C,,,” tg, = On 0a 2,7 * +16, = ChE = Son 2K 


= 
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The three axes of the second order are-mutually equivalent. The axes of 
the third order are also equivalent, since they are carried into one another by 
the rotations C,, but they are not bilateral axes. Hence it follows that the 
twelve elements in the group T are divided into four classes: E, the three 
rotations C,, the four rotations C, and the four rotations C,’. 


IX. The group T 

This group contains all the symmetry transformations of the tetrahedron. 
Its system of axes and planes can be obtained by adding to the axes of the 
group T planes of symmetry, each of which passes through one axis of the 





second order and two of the third order. The axes of the second order thereby 
become rotary-reflection axes of the fourth order (as in the case of the group 
Dog). This system is conveniently represented by showing the three rotary- 
reflection axes as passing through the centres of opposite faces of a cube, the 
four axes of the third order as its spatial diagonals, and the six planes of 
symmetry as passing through each pair of opposite edges (Fig. 37 shows one 
of each kind of axis and one plane). 

Since the planes of symmetry are vertical with respect to the axes of the 
third order, the latter are bilateral axes. All the axes and planes of a given 
kind are equivalent. Hence the 24 elements of this group are divided into 
the following five classes: E, eight rotations Cs and C}, six reflections in 
planes, six rotary—reflection transformations Sq and S43, and three rotations 
Core 
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X. The group T, 

This group is obtained from T by adding a centre of symmetry: T, 
= TXC;. As a result, three mutually perpendicular planes of symmetry 
appear, passing through each pair of axes of the second order, and the axes of 
the third order become rotary-—reflection axes of the sixth order (Fig. 38 
shows one of each kind of axis and one plane). 





The group contains 24 elements divided among eight classes, which are 
obtained at once from those of the group T. 


XI. The group O (the octahedron group) 
The system of axes of this group is the system of axes of symmetry of a 
cube: three axes of the fourth order pass through the centres of opposite 
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faces, four axes of the third order through spposite corners, and six axes of the 
second order through the midpoints of opposite edges (Fig. 39). 

It is easy to see that all the axes of a given order are equivalent, and each 
of them is bilateral. Hence the 24 elements are divided among the following 


five classes: E, eight rotations C, and C}, six rotations L, and, C.S, three 
rotations C, and six rotations C}. 


XII. The group O, 
This is the group of all symmetry transformations of the cube.t It is 
+ The groups T, Ta, Tn, O, O, are called cubic groups. 
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obtained by adding to the group O a centre of symmetry: O, = O xC}. 
The axes of the third order in the group O are thereby converted into rotary- 
reflection axes of the sixth order (the spatial diagonals of the cube); in 
addition, another six planes of symmetry appear, passing through each pair 
of opposite edges, and three planes parallel to the faces of the cube (Fig. 
40). The group contains 48 elements divided among ten classes, which 


= 


foot ar? a 





can be at once obtained from those of the group O; five classes are the same 
as those of the group O, while the remainder are: J, eight rotary—reflection 
transformations Sg and S¢5, six rotary—reflection transformations C4on, 
C0, about axes of the fourth order, three reflections o, in planes horizontal 
with respect to the axes of the fourth order, and six reflections o, in planes 
vertical with respect to these axes. 


XIII, XIV. The groups Y, Y, (the icosahedron groups) 


These groups occur only exceptionally in Nature as symmetry groups of 
molecules. Hence we shall here only mention that Y is a group of 60 rotations 
about the axes of symmetry of the icosahedron (a regular solid with twenty 
triangular faces) or of the pentagonal dodecahedron (a regular solid with 
twelve pentagonal faces); there are six axes of the fifth order, ten of the 
third and fifteen of the second. The group Yp is obtained by adding a centre 
of symmetry: Y, = Y x Cj, and is the complete group of symmetry trans- 
formations of the above-mentioned polyhedra. 

This exhausts all possible types of point group containing a finite number 
of elements. In addition, we must consider what are called continuous point 
groups, which contain an infinite number of elements. This we shall do in 


898. 


§94. Representations of groups 


Let us consider any symmetry group, and let 4, be some one-valued func- 
tion of the coordinates in the configuration space of the physical system 
concerned. Under the transformation of the coordinate system which 
corresponds to an element G of the group, this function is changed into 
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some other function. On performing in turn all the g transformations 
in the group (g being the order of the group), we in general obtain g different 
functions from yi. For certain %1, however, some of these functions 
may be linearly dependent. As a result we obtain some number f (< g) of 
linearly independent functions ya, Yo, ..., Yp which are transformed into 
linear combinations of one another under the transformations belonging to 
the group in question. In other words, as a result of the transformation G, 
each of the functions p; (i = 1, 2, 3, ... , f) is changed into a linear combina- 
tion of the form 


f 
a G kir 


where the G,, are constants depending on the transformation G. The array 
of these constants is called the matrix of the transformation.T 

In this connection it is convenient to regard the elements G of the group 
as operators acting on the functions ys,, so that we can write 


Goi; = E Guth (94.1) 


the functions y; can always be chosen so as to be orthonormal. Then the 
concept of the matrix of the transformation is the same as that of the matrix 
of the operator, in the form defined in §11: 


Gie = | oP Ge dg. (94.2) 


To the product of two elements G and H of the group there corresponds 
the matrix obtained from the matrices of G and H by the ordinary rule 
of matrix multiplication (11.12): 


(GH ix = 2 Gafı- (94.3) 


The set of matrices of all the elements in a group is called a representation 
of the group. The functions y}, ... , Yy with respect to which these matrices 
are defined are called the basis of the representation. The number f of these 
functions gives what is called the dimension of the representation. 

Let us consider the integrals [y;*yx, dg. Since the integration is taken 
over all space, it is evident that the values of the integrals are unchanged by 
any rotation or reflection of the coordinate system. That is, the symmetry 
transformations do not destroy the orthonormality of the base functions, 
and therefore the operators G are unitary (see §12).{ Accordingly, the 


matrices which represent the elements of a group in a representation with 
an orthonormalized basis are also unitary. 


t Since the functions $; are assumed one-valued, a definite matrix corresponds to each 
element of the group. 

t In this argument it is important that the integrals are either equal to zero (for i# k), or 
definitely not zero (for i = k) because the integrand |y;|? is positive. , 
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Suppose that we perform on the system of functions y1, ..., p the linear 
unitary transformation 


Yi = Shi. (94.4) 


This gives a new system of functions ¥j,..., Y's, which are also orthonormal 
(see §12).t If we now take, as the basis of the representation, the functions 
W'a we obtain a new representation of the same dimension. Such representa- 
tions, obtained from one another by a linear transformation of their base 
functions, are said to be equivalent; it is evident that they are not essentially 
different. 

The matrices of equivalent representations can be simply expressed in terms 
of one another. According to (12.7), the matrix of the operator G in the new 
representation is the matrix of the operator 


G' =S7GS (94.5) 


in the old representation. 

The sum of the diagonal elements (1.e. the trace) of the matrix representing 
an element G of a group is called its character; we shall denote it by x(G). 
It is a very important result that the characters of the matrices of equivalent 
representations are the same (see (12.11)). This circumstance gives par- 
ticular importance to the description of group representations by stating 
their characters: it enables us to distinguish at once the fundamentally 
different representations from those which are equivalent. Henceforward 
we shall regard as different representations only those which are not 
equivalent. 

If we take S in (94.5) to be that element of the group which relates the 
conjugate elements G and G’, we have the result that, in any given represen- 
tation of a group, the characters of the matrices representing elements of the 
same class are the same. 

The identical transformation corresponds to the unit element E of the 
group. Hence the matrix representing the latter is diagonal in every represen- 
tation, and the diagonal elements are unity. The character x(£) is con- 
sequently just the dimension of the representation: 


x(E) =f (94.6) 


Let us consider some representation of dimension f. It may happen that, 
as a result of a suitable linear transformation (94.4), the base functions 
divide into sets of fi, fa .. functions (f +fa+.-. =f), in such a way that, 
when any element of the group acts on them, the functions in each set are 
transformed only into combinations of themselves, and do not involve 
functions from other sets. In such a case the representation in question is 
said to be reducible. 

If, on the other hand, the number of base functions that are transformed 
only into combinations of themselves cannot be reduced by any linear trans- 
formation of them, the representation which they give is said to be irreducible, 


+ From (12.12), the unitarity of the transformations implies that the sum of the squared 
moduli of the base functions is invariant. 
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Any reducible representation can, as we say, be decomposed into irreducible 
ones. This means that, by the appropriate linear transformation, the base 
functions divide into several sets, of which each is transformed by some 
irreducible representation when the elements of the group act on it. Here 
it may be found that several different sets transform by the same irreducible 
representation; in such a case this irreducible representation is said to be 
contained so many times in the reducible one. 

Irreducible representations are an important characteristic of a group, and 
play a fundamental part in all quantum-mechanical applications of group 
theory. We shall give the chief properties of irreducible representations. t 

It may be shown that the number of different irreducible representations 
of a group is equal to the number r of classes in the group. We shall distin- 
guish the characters of the various irreducible representations by indices; 
the characters of the matrices of the element G in the representations are 
xP(G), x®(G), ... » x¥(G). 

The matrix elements of irreducible representations satisfy anumber of ortho- 
gonality relations. First of all, for two different irreducible representations 
the relations 


5 GO p GP m = 0 (94.7) 


hold, where « and £ (a # f) refer to the two irreducible representations, and the 
summation is taken over all the elements of the group. For any irreducible 
representation the relations 


z CO. CO mn = Faden (94.8) 


Q 


hold, i.e. only the sums of the squared moduli of the matrix elements are not 
Zero: 


E [Cen]? = ely 
The relations (94.7), (94.8) can be combined in the form 


2 CACA m" = “aphid (94.9) 


x 


In particular, we can obtain from this an important orthogonality relation 
for the characters of the representations. Summing both sides of equation 
(94.9) over equal values of the suffixes i, k and lL, m, we have 


2 XI%(G)xA(G)* = 8.29. (94.10) 
For « = £ we have 
Z IX'XG)|? =g, 


t The proof of these properties may be found in any textbook on group theory. 
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i.e. the sum of the squared moduli of the characters of an irreducible represen- 
tation is equal to the order of the group. We may notice that this relation 
can be used as a criterion of the irreducibility of a representation; for a 
reducible representation, this sum is always greater than g (for instance, It 1S 
ng if the representation contains n different irreducible parts). 

It also follows from (94.10) that the equality of the characters of two 
irreducible representations is not only a necessary but also a sufficient con- 
dition for them to be equivalent. 

Since the characters of elements of the same class are equal, the sum 
(94.10) actually contains only z independent terms, and can be written in 
the form 


dgcx@(C)xO(C)* = Boag (94.11) 


where the summation is over the r classes of the group (arbitrarily denoted 
by C) and gc is the number of elements in class C. 

Since the number of irreducible representations is equal to the number of 
classes, the quantities fec =-V(gclg)x(C) form a square matrix of re 
quantities. 

The orthogonality relations for the first suffix, 


È facheo™ = Sap, 
then automatically give those for the second suffix, 
2 fac ac* = bcc. 
Hence, besides (94.11), we have 


E xUC)xUC)* = (glec)8ce-. (94.12) 


Among the irreducible representations of any group there is always a 
trivial one, given by a single base function invariant under all the transforma- 
tions in the group. This one-dimensional representation is called the unit 
representation; in it, all characters are unity. If one of the representations 
in the orthogonality relation (94.10) or (94.11) is the unit representation, 
the other is such that 


X x(G) = Zecx(C) = 0, (94.13) 


ie. the sum of the characters of all the elements of the group is zero for 
every irreducible representation. f ' 
The relation (94.10) enables any reducible representation to be very easily 
decomposed into irreducible ones if the characters of both are known. Let 
x(G) be the characters of some reducible representation of dimension f, 
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and let the numbers a‘), a), ... ,a™ indicate how many times the cor- 
responding irreducible representations are contained in it, so that 


a df, =f, (94.14) 


where fg are the dimensions of the irreducible representations. Then the 
characters y(G) can be written 


r 


Maa E AG). (94.15) 


Multiplying this equation by y©(G)* and summing over all G, we have by 
(94.10) 


1 
a) == ZCK: (94.16) 


Let us consider a representation of dimension f = g, given by the_g 
functions Gy, yy being some general function of the coordinates (so that 
all the g functions Gy obtained from it are linearly independent); such a 
representation is said to be regular. It is clear that none of the matrices of 
this representation will contain any diagonal elements, with the exception of 
the matrix corresponding to the unit element; hence x(G) = 0 for G # E, 
while x(Z) = g. Decomposing this representation into irreducible ones, we 
have for the numbers a‘), by (94.16), the values a@ = (I/g)gf™ = f, Le. 
each irreducible representation 1s contained in the reducible one under 
consideration as many times as its dimension. Substituting this in (94.14), 
we find the relation 


ARR T ... +/7 = 8; (94.17) 


the sum of the squared dimensions of the irreducible representations of a 
group is equal to its order.t Hence it follows, in particular, that for Abelian 
groups (where 7 = g) all the irreducible representations are of dimension one 
=f = =f, = 1). 

We may also remark, without proof, that the dimensions of the irreducible 
representations of a group divide its order. 

In practice, the decomposition of a regular representation into irreducible 
parts is made by means of the formula 


x Fi x 
pO = > GrO*Ĝy. 94.18 
r > k yp ( ) 


It is easy to verify that the functions ¥;(i = 1, 2, ..., f,) represented by 
this formula with a given value of & are transformed according to 


enn == 2 Guy, 


+ It may be mentioned that, for point groups, equation 


t i ' i (94.17) for given r and g can in 
practice be satisfied in only one way by a set of integers f, 


> a=} i 
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i.e. they are a basis of the ath irreducible representation. By giving various 
values to & we obtain in this way f, different sets of base functions yj for 
one irreducible representation, in accordance with the fact that each 
irreducible representation appears f, times in the regular representation. 

Any function ys may be written as a sum of functions transformed by the 
irreducible representations of the group. This problem is solved by the 
formulae 


_ © gt ALS Go 
yp Pay oa: pi) aoe (94.19) 


To prove this, we substitute the second formula in the first and calculate 
the sum over 7, obtaining 


p= DRE . Ĝy. (94.20) 


Since the dimensions f, coincide with the characters y()(E) of the unit 
element of the group, we can use the orthogonality relation (94.12) to show 
that the sum in (94.20) is non-zero (and equal to g) only if G is the unit 
element of the group. Hence the right-hand side of (94.20) is identically 
equal to w. 

Let us consider two different systems of functions ¥,,..., yy © and 
pA, ae, p, O, which form two irreducible representations of a group. 
By forming the products 5,4) we obtain a system of fa fg new functions, 
which can serve as the basis for a new representation of dimension f, fp. 
This representation is called the direct product or Kronecker product of the 
other two; it is irreducible only if f, or fẹ is unity. It is easy to see that the 
characters of the direct product are equal to the products of the characters 
of the two component representations. For, if 


Cy = E Guh, Gp? = E Gm À, 
then 
OPORA = E GC ma PY; 
hence we have for the characters, which we denote by (x™xx¥)(G), 
(xy xxX)(G) = 5 Cu GP E z Gi p Gi, 


i.e. 


(xx O)(G) = xXG)x(G). (94.21) 


The two irreducible representations so multiplied may, in particular, be 
the same; in this case we have two different sets of functions yy, -..» Ys 
and ¢,, --- $y giving the same representation, while the direct product of 
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the representation with itself is given by the f? functions ýp, and has the 
characters 


(xxx)(G) = KG). 


This reducible representation can be at once decomposed into two represen- 
tations of smaller dimension (although these are, in general, themselves 
reducible). One of them is given by the 4f(/+1) functions 4.4,+1,d,, the 
other by the $/(f—1) functions 4.4¢,—%,¢,,7 #% k; it is evident that the func- 
tions in each of these sets are transformed only into combinations of them- 
selves. The former is called the symmetric product of the representation with 
itself, and its characters are denoted by the symbol [y?](G); the latter is 
called the antisymmetric product, and its characters are denoted by {x3} (G). 
To determine the characters of the symmetric product, we write 


Gihidiet Prp) = R> CGuCG mepi p mt Ymp) 
=4 A, (CuG met G mC uim t Y mfr): 


Hence we have for the character 


KG) =% 2 (GiiGrrt GixGri). 


But E Gu = x(G), and E GiGr = x(G?); thus we finally obtain the 


formula 
ELKO = HE(G)P+x(G)}, (94.22) 


which enables us to determine the characters of the symmetric product of a 
representation with itself from the characters of the representation. In an 
exactly similar manner, we find for the characters of the antisymmetric 
product the formulat 


XG) = HIG) P—x(G?)}. (94.23) 


If the functions ¥, and ¢, are the same, we can evidently construct from 
them only the symmetric product, formed by the squares y¥,* and the pro- 
ducts Yip i # k. In applications, symmetric products of higher orders are 
also encountered; their characters may be obtained in a similar manner. 

An important property of direct products is the following. The decom- 
position of the direct product of two different irreducible representations into 
irreducible parts contains the unit representation (and only once) only if 
the representations multiplied together are complex conjugates. For real 
representations, the unit representation is present only in the direct product 
of an irreducible representation with itself, and is of course in the symmetric 
part. In order to know whether the unit representation is present in the 


+ It is useful to note that, 
equal to the determinants of 
calculation. 


for representations of dimension 2, the characters {x?}(G) are 
the linear transformations G, as can easily be shown by direct 
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representation (94.21), we simply sum its characters with respect to G and 
divide the result by the order g of the group, in accordance with (94.16). The 
conclusion stated then follows at once from the orthogonality relations 
(94.10). 

Finally, we shall make a few remarks regarding the irreducible represen- 
tations of a group which is the direct product of two other groups (not 
to be confused with the direct product of two representations of the same 
group). If the functions y;© give an irreducible representation of the group 
A, and the functions ¢,) give one of the group B, the products ¢,4,™ 
are the basis of an f, fg-dimensional representation of the group A xB, and 
this representation is irreducible. The characters of this representation are 
obtained by multiplying the corresponding characters of the original represen- 
tations (cf. the derivation of formula (94.21)); to an element C = AB of 
the group A xB there corresponds the character 


x(C) = x A)xF(B). (94.24) 


Multiplying together in this way all the irreducible representations of the 
groups A and B, we obtain all! the irreducible representations of the group 
A xB. 


§95. Irreducible representations of point groups 


Let us pass now to the actual determination of the irreducible represen- 
tations of point groups. The great majority of molecules have axes of 
syrrmetry only of the second, third, fourth or sixth order. Hence we shall 
not consider the icosahedron groups Y, Yn; we shall examine the groups 
Cn, Can, Cnv, Dn, Dan only for the values n = 1, 2, 3, 4, 6, and the groups 
Sen, Dna only for n = 1, 2, 3. 

The characters of the representations of these groups are shown in Table 7. 
Isomorphous groups have the same representations and are given together. 
The numbers in front of the symbols for the elements of a group in the upper 
rows show the numbers of elements in the corresponding classes (see §93). 
The left-hand columns show the conventional names usually given to the 
representations. The one-dimensional representations are denoted by the 
letters A, B, the two-dimensional ones by E, and the three-dimensional 
ones by F; the notation E for a two-dimensional irreducible representation 
should not be confused with the unit element of a group.t The base functions 
of A representations are symmetric, and those of B representations antisym- 
metric, with respect to rotations about a principal axis of the nth order. 
The functions of different symmetry with respect to a reflection cp are 
distinguished by the number of primes (one or two), while the suffixes 
g and u show the symmetry with respect to inversion. Beside the symbols 
for the representations are placed the letters x, y, z; these show the repre- 


+ The reason why two complex conjugate one-dimensional representations are shown as 
one two-dimensional one is explained in §96. 
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sentations by which the coordinates themselves are transformed. The z-axis 
is always taken along the principal axis of symmetry. The letters e and w 


denote 
€ = eeni/3 
e+e? = —1, 


we—-w = —1, 


The simplest problem is to determine the irreducible representations for 


TABLE 7 


Characters of irreducible representations of point groups 


—_ eee 


C; 
C: 





Ag A; z 


Au; x,y,z Bix 





Caa E C, 
Ca se E 
D, E Cë 
Ag Ay;2z A 1 1 
By B; y B33x 1 —1 
Au; z A, By; 2 i 1 


E;xtty E;xtiy l 














E T C, E C; Ç; 
E Ci 
E a A;z 1 1 1 
1 ce c 
E; xiy 
1 1 1 e c 
1 —1 
Cee E 2C; 3c; 
8 D, E 2C; 30; 
Aiz A, 1 1 1 
A, Ay; z 1 1 -1 





Ci EC Cs CG Cp C.’ 
EA 1 1 1 1 1 1 
B 1 —1 1 —1 1 —1 

{ 1 ow ew 1 ww? —w 
E, | 
l =w a? 1 —w w 
1 w wt J] —w =w? 
Ez: xiy 
1 —w? -w —] wt w 
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TABLE 7—continued 


Cor l EnO 26, sae. Gat. 
D, | E °C, 26°28) 2U; 

D,a E °C, "OS 20, We, 

A32 A, Ay 1 1 1 1 1 
A, Ass z A, 1 1 1 —1 —1 
B, B; B, 1 1 -1 1 —1 
B, B: a 1 1 —1 —1 1 


Bex,y Eryo Ex, Y 2 —2 0 0 0 








De E € 2G; 20; 3U: IU; 
Cee el, 2O Ley: ee “Sa 
Dan E oy 2C, 2S; 3U, 3o; 
Ay Ay;2 A, | 1 1 1 1 1 1 
Asz As A,’ 1 1 1 1 —1 -1 
B, B; A” 1 —-1 1 -!1 1 —1 
B, B, E od | 1 —1 1 —1 ==! 1 
E; E, Ex} 2 2 -1 —i 0 0 


Bik i ts Xp 








O E 8C, 3C, 6C, 6C, 

T E 3C, 4C, 4C;? T: E 8C, 3C; 6046S, 
A ti rpa A, A, | a ee Pe ee 
1 1 e æ As A; 1 1 1-1-1 

: | 1 1 æ e E E 2 =1 2 Olt 
F; x,y,z 3 —1 0 0 F, Fx yz |3 0 —1 1 =! 
Fixy z Fi 3 0 —1 —1 1 


DSO 
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the cyclic groups (Cn, Sn). A cyclic group, like any Abelian group, has only 
one-dimensional representations. Let G be a generating element of the group 
(i.e. one which, on being raised to successive powers, gives all the elements 
of the group). Since G? = E (where g is the order of the group), it is clear 
that, when the operator G acts on a base function W, the latter can be multi- 
plied only by 11/2, i.e. 


Cy = eto, (k =1,2,.., 2). 


The group Ce, (and the isomorphous groups C and D2) is Abelian, so 
that all its irreducible representations are one-dimensional, and the characters 
can only be + 1 (since the square of every element is E£). 

Next we consider the group C}, As compared with the group C,, the 
reflections ø, in vertical planes (all belonging to one class) are here added. 
A function invariant with respect to rotation about the axis (a base function 
of the representation A of the group C,) may be either symmetric or anti- 
symmetric with respect to the reflections o,. Functions multiplied by e 
and e? under the rotation C,, on the other hand (base functions of the com- 
plex conjugate representations E), change into each other on reflection. f 
It follows from these considerations that the group C,, (and D,, which is 
isomorphous with it) has two one-dimensional irreducible representations 
and one two-dimensional, with the characters shown in the table. The fact 
that we have indeed found all the irreducible representations may be seen 
from the result 174 1242? = 6, which is the order of the group. 

Similar considerations give the characters of the representations of other 
groups of the same type (Cie C,,). 

The group T is obtained from the group De = V by adding rotations about 
four oblique axes of the third order. A function invariant with respect to 
transformations of the group V (a basis of the representation A) can be 
multiplied, under the rotation C3, by 1, e or «2. The base functions of the 
three one-dimensional representations Bı, B2, B3 of the group V change into 
one another under rotations about the axes of the third order (this is seen. 
for example, if we take as these functions the coordinates x, y, z themselves). 
Thus we obtain three one-dimensional irreducible representations and one 
three-dimensional (12+ 124 12432 = 12). 

Finally, let us consider the isomorphous groups O and T4. The group 
T 4 is obtained from the group T by adding reflections o, in planes each of 
which passes through two axes of the third order. A base function of the 
unit representation A of the group T may be symmetric or antisymmetric 
with respect to these refiections (which all belong to one class), and this 
gives two one-dimensional representations of the group T,. Functions 
multiplied by e or e? under a rotation about an axis of the third order (the 


t For the point group C, we can, for example, take as the functions y the functions e*¢, 
Roce | ade “My where ġ is the angle of rotation about the axis, measured from some fixed direction. 
t These functions may, tor example, be taken as py = ett, p2 = e- bo, 


À ; = On reflection in a 
vertical plane, ģ changes sign 
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basis of the complex conjugate representations E of the group T) change into 
each other on reflection in a plane passing through this axis, so that one 
two-dimensional representation is obtained. Finally, of three base functions 
of the representation F of the group T, one is transformed into itself on 
reflection (and can either remain unaltered or change sign), while the other 
two change into each other. Thus we have altogether two one-dimensional 
representations, one two-dimensional and two three-dimensional.t 

The representations of the remaining point groups in which we are inter- 
ested can be obtained immediately from those already given, if we notice 
that the remaining groups are direct products of those already considered 
with the group C; (or C,): 


C= €,xC, Da = D.xC; Daa = DXG; 

Cin = CXC; Dy, = DyxC; Den = Dx; 

Cen = CXC; Se = CXC; T, = TXC; 
O0, =OxC, 


Each of these direct products has twice as many irreducible representations 
as the original group, half of them being symmetric (denoted by the suffix g) 
and the other half antisymmetric (suffix u) with respect to inversion. The 
characters of these representations are obtained from those of the representa- 
tions of the original group by multiplying by +1 (in accordance with the 
rule (94.24)). Thus, for instance, we have for the group Daa the repre- 
sentations: 


De E aC, BÜ I 2S, 304 
a l 1 1 1 1 1 
Ag 1 bi 1 LP uot 
E, | 0 i FEI 0 
By 1 1 a = = 
A l { =L ca Tai 1 
E, Ja s Dae S2 1 0 


$96. Irreducible representations and the classification of terms 


The quantum-mechanical applications of group theory are based on the 
fact that the Schrödinger’s equation for a physical system (an atom or 
molecule) is invariant with respect to symmetry transformations of the 


+ Irreducibie representations of higher dimension (4 and 5) occur in the icosahedron 
groups. 
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system.f It follows at once from this that, on applying the elements of a group 
to a function satisfying Schrédinger’s equation for some value of the energy 
(an eigenvalue), we must again obtain solutions of the same equation for the 
same value of the energy. In other words, under a symmetry transformation 
the wave functions of the stationary states of the system belonging to a given 
energy level transform into linear combinations of one another, i.e. they give 
some representation of the group. An important fact is that this representa- 
tion is irreducible. For functions which are invariably transformed into linear 
combinations of themselves under symmetry transformations must belong 
to the same energy level; the equality of the eigenvalues of the energy cor- 
responding to several groups of functions (into which the basis of a reducible 
representation can be divided), which are not transformed into combinations 
of one another, would be an improbable coincidence. f 

Thus, to each energy level of the system, there corresponds some irreduc- 
ible representation of its symmetry group. The dimension of this represen- 
tation determines the degree of degeneracy of the level concerned, i.e. the 
number of different states with the energy in question. The fixing of the 
irreducible representation determines all the symmetry properties of the 
given state, i.e. its behaviour with respect to the various symmetry trans- 
formations. 

Irreducible representations of dimension greater than one are found only 
in groups containing non-commuting elements; Abelian groups have only 
one-dimensional irreducible representations. It is apposite to recall here that 
the relation between degeneracy and the presence of operators which do not 
commute with one another (but do commute with the Hamiltonian) has 
already been found above from considerations unrelated to group theory 
(§10). 

The following important reservation should be made regarding all these 
Statements. As has already been pointed out (§18), the symmetry (valid in 
the absence of. a magnetic field) with respect to a change in the sign of the 
time has, in quantum mechanics, the result that complex conjugate wave 
functions must belong to the same eigenvalue of the energy. Hence it follows 
that, if some set of functions and the set of complex conjugate functions give 
different (non-equivalent) irreducible representations of a group, these two 
complex conjugate representations must be regarded as forming together a 
single “physically irreducible” representation of twice the dimension. 
This will be assumed below. In the preceding section we had examples of 
such representations. Thus the group C3 has only one-dimensional repre- 
sentations, however, two of these are complex conjugates, and correspond 
physically to doubly degenerate energy levels. (In the presence of a magnetic 
field there is no symmetry with respect to a change in the sign of the time, 





t The methods of group theory were first applied in quantum mechanics by E. P. Wigner 
(1926). 

t Provided that there is no special reason for this. 
‘accidental’ degeneracy that arises because the Hamil 
symmetry than the purely geometrical symmetry consi 
end of §36). 


Reference may be made here to the 
tonian of a system can have a higher 
dered in the present chapter (see the 
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and hence complex conjugate representations correspond to different energy 
levels.) 

Let us suppose that a physical system is subjected to the action of some 
perturbation (i.e. the system is placed in an external field). The question 
arises to what extent the perturbation can result in a splitting of the degener- 
ate levels. The external field has itself a certain symmetry. If this symmetry 
is the same as or higher) than that of the unperturbed system, the symmetry 
of the perturbed Hamiltonian H = H,+V is the same as the symmetry 
of the unperturbed operator Hy. It is clear that, in this case, no splitting 
of the degenerate levels occurs. If, however, the symmetry of the pertur- 
bation is lower than that of the unperturbed system, then the symmetry of 
the Hamiltonian A is the same as that of the perturbation 7. The wave 
functions which gave an irreducible representation of the symmetry group 
of the operator H, will also give a representation of the symmetry group 
of the perturbed operator H, but this representation may be reducible, and 
this means that the degenerate level is split. 

We shall show by means of an example how the mathematical techniques 
of group theory enable us to solve the problem of the splitting of any given 
level. 

Let the unperturbed system have symmetry Ta, and let us consider a 
triply degenerate level corresponding to the irreducible representation Fe 
of this group. The characters of this representation are 


Ẹ 8C3 3C2 60g 6S4 
3 0 —1 l —1 


Let us assume that the system is subjected to the action of a perturbation 
with symmetry C3, (with the third-order axis coinciding with one of those of 
the group Ta). The three wave functions of the degenerate level give a 
representation of the group C3, (which is a sub-group of the group Ta), and 
the characters of this representation are equal to those of the same elements 
in the original representation of the group Ta, l.e. 


E 2C3 36y 
3 0 1 


This representation, however, is reducible. Knowing the characters of the 


+ Strictly speaking, the fact that the characters are real (i.e, that the complex conjugate 
representations are equivalent) is not a sufficient condition for the possibility of choosing real 
base functions of the representation of the group. For irreducible representations of point 
groups, however, it is sufficient (though not for the “double” point groups; see §99). 

1 For example, ın the case of the energy levels of the d and f shells of ions in a crystal lattice 
which interact slightly with the surrounding atoms, the perturbation (the external field) is the 
field acting on an ion due to the other atoms. 

I If a symmetry group H 1s a sub-group of the group G, we say that H corresponds to a lower 
symmetry and G to a higher symmetry. lt is evident that the symmetry of the sum of two ex- 
pressions, one of which has the symmetry of G and the other that of H, is the lower symmetry, 
that of H. 
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irreducible representations of the group C3y, it is easy to decompose it into 
irreducible parts, using the general rule (94.16). Thus we find that it consists 
of the representations A; and E of the group Czy. The triply degenerate 
level F2 is therefore split into one non-degenerate level A, and one doubly 
degenerate level E. If the same system is subjected to the action of a per- 
turbation of symmetry Czv, which is also a sub-group of the group Ta, then 
the wave functions of the same level Fe give a representation with characters 


E Co Gs cs 


-r 


3 —1 1 l 


Decomposing this into irreducible parts, we find that it contains the repre- 
sentations A), B1, Be. Thus in this case the level is completely split into three 
non-degenerate levels. 


§97. Selection rules for matrix elements 


Group theory not only enables us to carry out a classification of the terms 
of any symmetrical physical system, but also gives us a simple method of 
finding the selection rules for the matrix elements of the various quantities 
which characterize the system. 

This method is based on the following general theorem. Let ¢,@ be one 
of the base functions of an irreducible (non-unit) representation of asymmetry 
groun Then the integral of this function over all spacet vanishes identically: 


Í iO dg = 0. (97.1) 


The proof is based on the evident fact that the integral over all space is 
invariant with respect to any transformation of the coordinate system, 
including any symmetry transformation. Hence 


Í p dg = Í Gp dg = Í es Gif PE dq. 


We sum this equation over all the elements of the group. The integral on 
the left is simply multiplied by g, the order of the group, and we have 


g | ydg =F Í W E Gis dq. 


However, for any non-unit irreducible representation we have identically 
> Gp; = 0: 

this isa particular case of the orthogonality relations (94.7), when one of 

the irreducible representations is the unit representation. This proves the 


theorem. 


t That is, the configuration space of the Physical system concerned. 
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If #) is a function belonging to the basis of some reducible representation 
of a group, the integral f ¢ dg will be zero except when this representation 
contains the unit representation. This theorem is a direct consequence of 
the previous one. 

The matrix elements of a physical quantity f are given by the integrals 


CBR] f Jai) = | px fei dq, (97.2) 


~ 


where the indices « and £ distinguish different energy levels of the system, 
and the suffixes 1, k denumerate wave functions belonging to the same 
degenerate level.| We denote the irreducible representations of the symmetry 
group of the system concerned that are given by the functions %;) and 
Y ® by the symbols D® and D\), and by Dy the representation of the same 
group that corresponds to the symmetry of the quantity f; this representation 
depends on the tensor character of f. For example, if f is a true scalar, then 
its operator f is invariant under all the symmetry transformations, and Dy is 
the unit representation. The same occurs for a pseudoscalar quantity if 
the group contains only axes of symmetry, but if there are also reflections, 
Dy; is not the unit representation, though its dimension is unity. If f is a 
vector, then Dy is a representation given by the three vector components 
that are transformed into combinations of each other; this representation 1s 
in general different for polar and axial vectors. 

The products yP fy;®© give the representation that is the direct product 
DA x Dy x D®. The matrix elements are non-zero if this representation 
contains the unit representation or, equivalently, if the direct product 
D‘) x D® contains Dy. In practice, it is more convenient to decompose 
into irreducible parts the product D) x Dy; this gives us immediately all 
the types D(® of states for transitions into which (from a state of the type 
D) the matrix elements are not zero. 

Inthe simplest case of a scalar quantity, for which Dy is the unit representa- 
tion, it then follows immediately that the matrix elements are non-zero only 
for transitions between states of the same type: the direct product D™ x DY 
of two different irreducible representations does not contain the unit 
representation, but the latter is always present in the direct product of an 
irreducible representation with itself. This is most general statement of a 
theorem of which particular cases have already been met with. 

The matrix elements diagonal with respect to energy, i.e. those for transi- 
tions between states belonging to the same term (as opposed to transitions 
between states belonging to two different terms of the same type), need special 
treatment. In this case we have only one set of functions #1), p2, ..., not 
two different ones. The selection rules here are found by different methods, 
depending on the behaviour of the quantity f under time reversal. 


+ Since the base functions can be taken as real when “physically irreducible” represen- 
tations are used, we do not distinguish in (97.2) between the wave functions and their complex 


conjugates. 
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Let us consider a state described by a wave function of the form yẹ = 
E cyi_*). The mean value of f in this state is given by the sum 


f = 2 CEC Caki f aiy. 


In the state with the complex conjugate wave function $* = È aj*h, 
we have 


7 = 2 cpe" Cakj f lai) 
== D ceg" <ai| f |ak>. 


If f is invariant under time reversal, the two states not only belong to the 
same energy level but must also have the same value of f. Since the co- 
efficients c; are arbitrary, this means that 


<ak| flot> = <ai| flak. 


Hence, in order to find the selection rules, we must consider not the direct 
product D® x D®) as a whole, but only its symmetric part [D‘)2}; there are 
non-zero matrix elements if [D2] contains Dy. 

If, however, f changes sign under time reversal, the change from ¢ to p* 
has to be accompanied by a change in the sign of f. Hence we find by the 
same method that 


<ak| f jai) = —<oal flak). 


In this case, therefore, the selection rules are determined by the decom- 
position of the antisymmetric part of the direct product, {D2}. 


PROBLEMS 
PROBLEM 1. Find the selection rules for the matrix elements of the electric and magnetic 
dipole moments d and u when symmetry O is present. 
SOLUTION. The group O includes no reflections; the polar vector d and the axial vector u 
are therefore transformed by the same irreducible representation, F1. The decompositions of 
the direct products of F, with the other representations of the group O are 


romp eG ZF Fixa =a, RRE = Fit Fa (1) 
FLXF, = A FE+F tFn FixF, = AgtE+F tF 


Hence the non-zero non-diagonal (with respect to energy) matrix elements are those for the 
transltions 


Fie A,, E, Fa Fe; Fie da, E, Fy, Fa 





t The product [D0] always contains the unit representation, so that the diagonal elements 
(and non-diagonal elements between states of the same type) are non-zero for a scalar quantity. 
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The symmetric and antisymmetric products of the irreducible representations of the group O are 


[4:2] = [A422] = Ar, [E2) = AE F] = [Fo2] = Ay+E+ Fa, m 
f 


IE?} = do, Fi} = {F2} = Fa 
The symmetric products do not contain F;; hence there are no diagonal (with respect to energy) 
matrix elements of the vector d (which is invariant under time reversal). The magnetic moment, 
which changes sign under time reversal, has diagonal matrix elements for the states Fy and Fo. 
PROBLEM 2. The same as Problem 1, but for symmetry Dag. 
SOLUTION. The vectors d and p have different transformation laws in the group D3a: 
d, Y d; ~Eu, dz ~A2n, 
Ma, Ky ~E, z~o, 


here and in the Problems below, the symbol ~ stands for the words “‘is transformed by the 
representation’. We have 


EX 4, = E,X4,, =E,, EYXAy ~EuXan = E,; 


(3) 
E, XE, = Ay+An+E£,, ELXE&, = Aut Art En- 


Hence the non-diagonal matrix elements of dz, dy are non-zero for the transitions Fy,+>.4 49, 
Arg, Eg; Ege Aru, Azu, Eu. In the same way we find the selection rules 


for dz: Arg Azu; Arg Ayu; E; =E; 
for Ha» Hy: Ey Ag, Ady, Ey: Eu Au, Azu, Eu; 
for Hz: Arig Arg; Aru A u; E= E3; Eu Eu. 


The symmetric and antisymmetric products of the irreducible representations of the group 
Dza are 


[Arg] = [41°] = [429°] = (Aru?) = Arg, 


(4) 
[Eo?) = [Eu®?] = Eg+ Arg (Eg? = {Eu®} = Ag. 


Hence we see that there are no diagonal (with respect to energy) matrix elements for any of the 
components d; for the vector u, there are diagonal matrix elements of pz for transitions between 
states belonging to a degenerate level of the type Ey or Ey. 


PROBLEM 3. Find the selection rules for the matrix elements of the electric quadrupole 
moment tensor Oj, when symmetry O ts present. 


SoLuTION. The components of the tensor Qi (a symmetrical tensor with the sum Qk; equal 
to zero) with respect to group O are transformed by the laws 


Ory, Ozz, Oyz ~F., Orr o EOyy T e?Ozz, Ozz + eOyy T eQzz ~E 
(€ = g?ril3), 


Decor posing the direct products of Fz and E with all the representations of the group, we find 
the selection rules for the non-diagonal matrix elements: 


for Ory, Orz, Qvz: FAs, E, Fi, Fo; Foo Ay, E, Fi, F2; 
for Orr Oyy, Ozz: EÅ], Ao, E-: FiF), Fo; Foe F3. 


‘The diagonal matrix elements exist (as we see from (2)) in the following states: 


for Ory, Ozz, Oyz: Fi, Fo, 
for Orr, Qyy: Ozz: E Firg 
PROBLEM 4. The same as Problem 3, but for symmetry Daa. 
SOLUTION. The transformation laws of the components Qi with respect to the group 


Dsa are 
Qre~Aryi Orr- Oun Quy~Eoi Ore, Oye E; 
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Q2- behaves as a scalar. Decomposing the direct products of E, with all the representations of 
the group, we find the selection rules for the non-diagonal matrix elements of the remaining 
components Qip: i 


Ey ig, Sloan, E3; Ea diy, Ay, Ez: 


The diagonal elements are non-zero (as we see from (+)) only for the states E, and Ex. 


§98. Continuous groups 


As well as the finite point groups enumerated in §93, there exist also what 
are called continuous point groups, having an infinite number of elements. 
These are the groups of axial and spherical symmetry. 

The simplest axial symmetry group is the group Ce, which contains rota- 
tions C(¢) through any angle ¢ about the axis of symmetry; this is called the 
two-dimensional rotation group. It may be regarded as the limiting case of 
the groups C, as n -> œ. Similarly, as limiting cases of the groups Ca» 
Caos Dns Dap we obtain the continuous groups Cor Covi Da: Dah. 

A molecule has axial symmetry only if it consists of atoms lying in a straight 
line. If it meets this condition, but is asymmetric about its midpoint, its 
point group will be the group Co», which, besides rotations about the axis, 
contains also reflections o, in any plane passing through the axis. If, on 
the other hand, the molecule is symmetrical about its midpoint, its point 
group will be Don = C,,,XC;. The groups Cos Cors Da cannot appear as 
the symmetry groups of a molecule. 

The group of complete spherical symmetry contains rotations through any 
angle about any axis passing through the centre, and reflections in any plane 
passing through the centre; this group, which we shall denote by K,, is 
the symmetry group of a single atom. It contains as a sub-group the group 
K of all spatial rotations (called the three-dimensional rotation group, or simply 
the rotation group).. The group K, can be obtained from the group K by 
adding a centre of symmetry (K, = K xC,). 

The elements of a continuous point group may be distinguished by one 
or more parameters which take a continuous range of values. Thus, in the 
rotation group, the parameters might be the three Eulerian angles, which 
define a rotation of the coordinates. 


The general properties of finite groups described in §92, and the concepts 
appertaining to them (sub-groups, conjugate elements, classes, etc.), can be 
at once generalized to continuous groups. Of course, the statements which 
directly concern the order of the group (for instance, that the order of a sub- 
group divides the order of the group) are no longer meaningful. 

In the group Ca, all planes of symmetry are equivalent, so that all reflec- 
tions o, form a single class with a continuous series of elements; the axis of 
symmetry is bilateral, so that there is a continuous series of classes, each 
containing two elements C(+¢). The classes of the group D „p are obtained 
at once from those of the group Cov since Don = C,,,C;. 

In the rotation group K, all axes are equivalent and bilateral; hence the 
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classes of this group are rotations through an angle of fixed absolute magnitude 
iġ| about any axis. The classes of the group K, are obtained at once from 
those of the group K. 

The concept of representations, reducible and irreducible, can also be 
immediately generalized to continuous groups. Each irreducible representa- 
tion contains an infinite sequence of matrices, but the number of base 
functions transformed into combinations of one another (the dimension of 
the representation) is finite. These functions may always be chosen so as to 
make the representation unitary. The number of different irreducible 
representations of a continuous group 1s infinite, but they form a discrete 
sequence, i.e. they can be numbered successively. For the matrix elements 
and characters of these representations there are orthogonality relations which 
generalize the corresponding ones for finite groups. Instead of (94.9), we 
now have 


| GixGim*ds — 7dasbubim | dz «¢, (98.1) 


and instead of (94.10) 
| x(G)x(G)* dre = Bip | dre. (98.2) 


The integration in these formulae is what is called an invariant integration 
over the group; the element drg is expressed in terms of the parameters of 
the group and their differentials in such a way as to remain an element when 
subjected to any transformation in the group.f For example, in the rotation 
group we can take drg = sin £ da df dy, where a, $ and y are the Eulerian 
angles, which define a rotation of the system of coordinates (§58); in this case, 
(dz7g= 872. 

We have already found, in essence, the irreducible representations of the 
three-dimensional rotation group (without using the terminology of group 
theory), when determining the eigenvalues and eigenfunctions of the total 
angular momentum. For the angular momentum component operators are 
(apart from a constant factor) the operators of infinitely small rotations,] 
and the eigenvalues of the angular momentum characterize the behaviour of 
the wave functions with respect to spatial rotations. To a value j of the 
angular momentum there correspond 2j+ 1 different eigenfunctions pjm, 
differing in the values of the component m of the angular momentum and 
all belonging to one (2j + 1)-fold degenerate energy level. Under rotations 
of the coordinate system, these functions are transformed into linear combi- 
nations of themselves, and thus give irreducible representations of the 


+ The statements made here about the properties of irreducible representations of con- 
tinuous groups are valid only if the integrals (98.1) and (98.2) converge; in particular, the 
“volume of the group” | dre must be finite. This condition 1s satished for continuous point 
groups (but not, for instance, for the Lorentz group which occurs in the relativistic theory). 

t In mathematical terms, these operators are the generators of the rotation group. 
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rotation group. Thus, from the group-theory point of view, the numbers j 
number the irreducible representations of the rotation group, and one 
(2j + 1)-dimensional representation corresponds to each j. The number ; 
takes integral and half-integral values, so that the dimension 2;+1 of the 
representations takes all the integral values 1, 2, 3, ... . 

The base functions of these representations have been, in essence, 
investigated in §§56 and 57, and the matrices of the representations have 
been found in $58. The basis of a representation of given j is formed by 
the 27+ 1 independent components of a symmetrical spinor of rank 27 (which 
are equivalent to the set of 27+ 1 functions yn). 

The irreducible representations of the rotation group which correspond 
to half-integral values of 7 are distinguished by an important property. 
Under a rotation through 27, the base functions of the representations 
change sign (being components of a spinor of odd rank). Since, however, 
a rotation through 27 is the same as the unit element of the group, we 
reach the result that representations with half-integral 7 are, as we say, 
two-valued; to each element of the group (a rotation through an angle 
¢,0 < $ < 2r, about some axis) there correspond in such a representation 
not one but two matrices, with characters differing in sign.f 

An isolated atom has, as we have already remarked, the symmetry K, 
= KxC,. Hence, from the group-theory point of view, there corresponds 
to each term of the atom some irreducible representation of the rotation 
group K (determining the value of the total angular momentum J of the atom) 
and an irreducible representation of the group C; (determining the parity 
of the state).f 

When the atom is placed in an external electric field, its energy levels are 
split. The number of different levels resulting and the symmetry of the 
Corresponding states can be determined by the method described in §96. 
It is necessary to decompose the (2/+ 1)-dimensional representation of the 
symmetry group of the external field (given by the functions ¢yaz) into 
irreducible representations of this group. This requires a knowledge of the 
characters of the representation given by the functions Wym. 

Since the characters of the irreducible representations of elements of one 
class are the same, it is sufficient to consider rotations about the z-axis. 
By a rotation through an angle ¢ about this axis the wave functions yy. 


t It must be mentioned that two-valued representations of a group are not representations 
in the true sense of the word, since they are not given by one-valued base functions; see 
also §99. 

J Moreover, the Hamiltonian of the atom is invariant with respect to interchanges of the 
electrons. In the non-relativistic approximation, the coordinate and spin wave functions are 
separable, and we can speak of representations of the permutation group that are given by the 
coordinate functions. If the irreducible representation of the permutation group is given, the 
total spin S of the atom is determined (§63). When the relativistic interactions are taken into 
account, however, the separation of the wave functions into coordinate and spin parts is not 
possible. The symmetry with respect to simultaneous interchange of the coordinates and 
spins of the particles does not characterize the term, since Pauli’s principle admits only those 
total wave functions which are antisymmetric with respect to all the electrons. This is in 
accordance with the fact that, when the relativistic interactions are taken into account, the 
spin is not, strictly speaking, conserved; only the total angular momentum J is maven. 
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are, as we know, multiplied by eè¥$%, where M is the component of the angular 
momentum along this axis. The transformation matrix for the functions 
Wya will therefore be diagonal, with character 


J eJ +o — ¢-ié 
e pe iMó — i 
x (d) > ee 
ort 
in( J+ 
pee (98.3) 
sintd 


With respect to inversion J, all the functions yy m with different M behave 
in the same way, being multiplied by +1 or — 1 according as the state of the 
atom is even or odd. Hence the character 


xA = +(2F+4+1). (98.4) 


Finally, the characters corresponding to reflection in a plane o and rotary 
reflection through an angle ¢ are found by writing these symmetry trans- 
formations as 


S216,  S($)= 1c). 


Let us pause to consider also the irreducible representations of the 
axial symmetry group Cer- This problem has, in essence, been solved when 
we ascertained the classification of the electron terms of a diatomic molecule 
having this symmetry C,,, (ie. when the two atoms are different). To the 
terms 0+ and 0- (with Q = 0) there correspond two one-dimensional 
irreducible representations: the unit representation A; and the representation 
A», in which the base function is invariant under all rotations and changes 
sign under reflections in planes c», while to the doubly degenerate terms with 
Q = 1, 2,... there correspond two-dimensional representations denoted by 
E, Eo, .... Under a rotation through an angle ¢ about the axis, the base 
functions are multiplied by e+*, while on reflection in planes o, they change 
into each other. The characters of these representations are 


Cw E 2C(¢) COdy 
Ay 1 1 1 
Az 1 1 a Oa) 
Ey 2 _2coskd 0 


+ To avoid misunderstanding, it should be emphasized that this formula corresponds to a 
parametrization of the group elements other than that by the Eulerian angles: the trans- 
formation is specified by the direction of the axis of rotation and the angle ¢ of the rotation 
about the axis. It can be shown that, with this parametrization, the integration in (98.2), for 
example, is to be taken over 2(1—cos ¢)d¢ do, where do is the element of solid angle for the 
direction of the axis of rotation. 
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The irreducible representations of the group Der = C,.,XC, are ob- 
tained at once from those of the group Cap (and correspond to the classifi- 
cation of the terms of a diatomic molecule composed of like nuclei). 

If we take half-integral values for Q, the functions e+'°@¢ give two-valued 
irreducible representations of the group C,,,, corresponding to the terms of 
the molecule having half-integral spin. 


§99. Two-valued representations of finite point groups 


To the states of a system with half-integral spin (and therefore half- 
integral total angular momentum) there correspond two-valued represen- 
tations of the point symmetry group of the system. This is a general property 
of spinors, and therefore holds for both continuous and finite point groups. 
The necessity thus arises of finding the two-valued irreducible representa- 
tions of finite point groups. 

As we have already remarked, the two-valued representations are not 
really true representations of a group. In particular, the relations discussed 
in §94 do not apply to them, and where all irreducible representations were 
considered in these relations (for example, in the relation (94.17) for the sum 
of the squared dimensions of the irreducible representations), only the true 
one-valued representations were meant. 

To find the two-valued representations, it is convenient to employ the 
following artifice (H. A. Bethe 1929). We introduce, in a purely formal 
manner, the concept of a new element of the group (denoted by Q); this is a 
rotation through an angle of 27 about an arbitrary axis, and is not the unit 
element, but gives the latter when applied twice: Ọ? = E. Accordingly, 
rotations C, about the axes of symmetry of the nth order will give identical 
transformations only after being applied 2” times (and not 7 times): 


C,” =Q, C,” = E. (99.1) 


The inversion J, being an element which commutes with all rotations, 


must give E as before on being applied twice. A twofold refiection in a plane, 
however, gives QO, not E: 


Ca @), ot = E; (99.2) 


this follows, since the reflection can be written in the form o, = IC, Asa 


eee ane 


t Contrary to the result for the three-dimensional rotation group, it would here be possible 
by a suitable choice of fractional values of Q, to obtain not only one-valued and oadd 
representations, but also those of three or more values. However, the physically possible 
eigenvalues of the angular momentum, which is the operator of an infinitely small rotation 
are determined by the representations of the aforementioned three-dimensional rotation 
group. Hence the three (or more)-valued representations of the two-dimensional Aion 


group (and of any finite symmetry group), though mathematically determinate, are without 
physical significance. á 
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result we obtain a set of elements forming some fictitious point symmetry 
group, whose order is twice that of the original group; such groups we shall 
call double point groups. The two-valued representations of the actual point 
group will clearly be one-valued (i.e. true) representations of the correspond- 
ing double group, so that they can be found by the usual methods. 

The number of classes in the double group is greater than in the original 
group (but not, in general, twice as great). The element Q commutes with all 
the other elements of the group,t and hence always forms a class by itself. If 
the axis of symmetry is bilateral, the elements C,,* and C,2"-* = OC,"~* are 
conjugate in the double group. Hence, when axes of the second order are 
present, the distribution of the elements among classes depends also on 
whether these axes are bilateral (in ordinary point groups this is unimportant, 
since C, is the same as the opposite rotation C,~). 

Thus, for instance, in the group T the axes of the second order are equiva- 
lent, and each of them is bilateral, while the axes of the third order are equiva- 
lent but not bilateral. Hence the 24 elements of the double group} T” are 
distributed in seven classes: E, Q, the class of three rotations C, and three 
C0, and the classes 4C,, 4C,”, 4C,Q, 4C,°Q. 

The irreducible representations of a double point group include, firstly, 
representations which are the same as the one-valued representations of the 
simple group (a unit matrix corresponding to both Q and E); secondly, 
the two-valued representations of the simple group, a negative unit matrix 
corresponding to Q. It is these latter representations in which we are now 
interested. 

The double groups C,,’ (n = 1, 2, 3, 4, 6) and S,’, like the corresponding 
simple groups, are cyclic.|| All their irreducible representations are one- 
dimensional, and can be found without difficulty as shown 1n §95. 

The irreducible representations of the groups D,’ (or C,,’, which are 
isomorphous with them) can be found by the same method as for the cor- 
responding simple groups. These representations are given by functions 
of the form e+**+, where ¢ is the angle of rotation about an axis of the nth 
order, and k is given half-integral values (the integral values correspond to 
the ordinary one-valued representations). Rotations about horizontal axes 
of the second order change these functions into one another, while the rota- 
tion C,, multiplies them by e+?7**/". 

It is a little less easy to find the representations of the double cubic groups. 
The 24 elements of the group T’ are.divided among seven classes. Hence 
there are altogether seven irreducible representations, of which four are 
the same as those of the simple group T. The sum of the squared dimensions 
of the remaining three representations must be 12, and hence we find that 
they are all two-dimensional. Since the elements C, and C0 belong to the 


+ This is obvious for rotations and inversion; for a reflection in a plane, it follows since the 
reflection can be represented as the product of an inversion and a rotation. 
t We distinguish the double groups by primes to the symbols for the ordinary groups. | 
The groups S2’ = Cr, Se’ = Cai’, however, which contain the inversion J, are Abelian 
but not cyclic. 
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TABLE 8 
Two-valued representations of point groups 


C,{2) Co) C,(2) 

















D; E Ọ QQ QMQ CQ 
E’ 2 —2 0 0 0 
Ca Ce 
Dy E Q CQO CQ 3U, 3U.0 
1 —] —1 1 I —i 
E, 
1 —1 —1 1 =i t 
Er 2 —2 1 —1 0 0 
Ç; C; C3? C, Ce’ 3U, Ser, 
Dé E 0 CQ C0 CQ CO CQ 3U,Q 3U’.Q 
Ee 2 —2 0 1 —1 /3 —vV/3 0 0 
Ey 2 —2 0 1 —1 —/3 /3 0 0 
Be 2 —2 0 —2 2 0 0 0 0 
C: C G2 obs 2er. 
DP; E Q QO C20 CxO 2U,0 2U’.Q 
EY 2 —2 0 /2 —/2 0 0 
E” 2 —2 0 —4/2 /2 0 0 
3C, 
Tr’ 5.) © 4C, 4C,? 4C,0 4C%Q 3C, 
E 2 —2 1 —1 —1 1 0 
2 —2 £ — ç? —eE g? 0 
G’ 
2 —2 <3 —e — ç? £ 0 


4C; 4C? LO 3C 4 3G 6C, 








0’ E Ọ 4C70 4C;0 3C,°0 3C8Q 3C.Q 6C.O 
E” 2 =? 1 —1 0 J2 cen? 0 
E; 2 —2 1 =i 0 a af 2 we 0 
ey 4 4 —1 1 0 0 0 0 


396 The Theory of Symmetry §99 


same class, y(C,) = x(C,Q) = —x(C,), whence we conclude that x(C,) = 0 
in all three representations. Next, at least one of the three representations 
must be real, since complex representations can occur only in conjugate 
pairs. Let us consider this representation, and suppose that the matrix of 
the element C, is brought to diagonal form, with diagonal elements a,, 4). 
Since C, = Q, a3 = a = —1. In order that x(C,) = a,+a, may be real, 
we must take a, = e"/3, a, = e-"*/3, Hence we find that (C,) = 1, x(C,) 
= a,?+a,2 = —1. Thus one of the required representations is obtained. By 
comparing its direct products with the two complex conjugate one-dimensional 
representations of the group T, we find the other two representations. 

By means of similar arguments, which we shall not pause to give here, we 
may find the representations of the group O’. Table 8 gives the characters 
of the representations of the double groups mentioned above. Only those 
representations are shown which correspond to two-valued representations 
of the ordinary groups. The tsomorphous double groups have the same 
representations. 

The remaining point groups are isomorphous with those we have con- 
sidered, or else are obtained by direct multiplication of the latter by the group 
C, so that their representations do not need to be specially calculated. 

For the same reasons as for ordinary representations, two complex con- 
jugate two-valued representations must be regarded as one physically 
irreducible representation of twice the dimension. It is necessary to pair 
one-dimensional two-valued representations even when they have real 
characters. For (see §60) in systems with half-integral spin, complex 
conjugate wave functions are linearly independent. Hence, if we have a 
two-valued on --dimensional representationt with real characters (given 
by some function 4), then, although the complex conjugate function 4” 1s 
transformed by an equivalent representation, we can nevertheless see that ¢ 
and y* are linearly independent. Since, on the other hand, the complex 
conjugate wave functions must belong to the same energy level, we see that 
in physical applications this representation must be doubled. 

The whole of the discussion in §97 concerning the method of finding the 
selection rules for the matrix elements of various physical quantities f 
remains valid for states of a system with half-integral spin, except as regards 
the matrix elements diagonal with respect to energy. On repeating the 
analysis at the end of §97 but with formulae (60.2) and (60.3), we find that, 
if the quantity f is even or odd under time reversal, we must use, in finding 
the selection rules, respectively the antisymmetric {D@?} and symmetric 
[D2] products of the representation D% with itself; this is the opposite 
of the rule stated in §97 for systems with integral spin.] 


+ Such representations are tound ın the group C,, tor odd n; the characters are yC) = 
(1y. 

t In connection with the application of these rules, it may be noted that for two-valued 
representations the unit representation is in the antisymmetric, not the symmetric, product 
of the representation with itself. For a two-valued representation with dimension 2, the 


product {D‘)*} is just the unit representation. 
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PROBLEM 
Determine how the levels of an atom (with given values of the total angular momentum J) 
are split when it is placed in a field having the cubic symmetryt O. 


SOLUTION. The wave functions of the states of an atom with angular momentum J and 
various values My give a (2 J +1)-dimensional reducible representation of the group O, with 
characters determined by the formula (98.3). Decomposing this representation into irreduc- 
ible parts (one-valued for integral J and two-valued for half-integral J), we at once find the 
required splitting (cf. §96). We shall list the irreducible parts of the representations corres- 
ponding to the first few values of J: 


j=? Ay 
1/2 H 
1 Fy 
3/2 G” 
2 E+F, 
5/2 E; +C 
3 A,+F,+F, 


t For example, an atom in a crystal lattice. The presence or absence of a centre of symmetry in 
the symmetry group of the external field is immaterial to this problem, since the behaviour of the 
wave function on inversion (the parity of the level) is unrelated to the angular momentum J. 


CHAPTER XIII 
POLY ATOMIC MOLECULES 


§100. The classification of molecular vibrations 


IN its applications to polyatomic molecules, group theory first of all resolves 
the problem of the classification of their electron terms, i.e. of the energy 
levels for a given situation of the nuclei. They are classified according to 
the irreducible representations of the point symmetry group appropriate to 
the configuration of the nuclei. Here, however, we must emphasize what 
is really obvious, that the classification thus obtained belongs to the definite 
nuclear configuration considered, since the symmetry is in general destroyed 
when the nuclei are displaced. We usually discuss the configuration cor- 
responding to the equilibrium position of the nuclei. In this case the classi- 
fication continues to possess a certain amount of meaning even when the 
nuclei execute small vibrations, but of course becomes meaningless when 
the vibrations can no longer be regarded as small. 

In the diatomic molecule this question did not arise, since its axial sym- 
metry is of course preserved under any displacement of the nuclei. A 
similar situation occurs for triatomic molecules also. The three nuclei 
always lie in a plane, which is a plane of symmetry of the molecule. Hence 
the classification of the electron terms of the triatomic molecule with respect 
to this plane (wave functions symmetric or antisymmetric with respect to 
reflection in the plane) is always possible. 

For the normal electron terms of polyatomic molecules there is an empirical 
rule according to which, in the great majority of molecules, the wave function 
of the normal electron state is completely symmetrical (this rule, for diatomic 
molecules, has already been mentioned in §78). Thus, the wave function 
is invariant with respect to all the elements of the symmetry group of the 
molecule, i.e. it belongs to the unit irreducible representation of the group. 

The application of the methods of group theory is particularly significant 
in the investigation of molecular vibrations (E. P. Wigner 1930). Before 
beginning a quantum-mechanical investigation of this problem, a purely 
classical discussion of the vibrations of the molecule is necessary, in which 
it is regarded as a system of several interacting particles (the nuclei). 

A system of N particles (not lying in a straight line) has 3V—6 vibrational 
degrees of freedom; of the total number of degrees of freedom 3N, three 
correspond to translational and three to rotational motion of the system as 
a whole (see Mechanics, §§23, 24).t The energy of a system of particles 


+ If all the particles lie in a straight line, the number of vibrational degrees of freedorn is 
3N—S. in this case, only two coordinates correspond to rotation, since it is meaningless to 
7 + ; à 
speak of the rotation of a linear molecule about its axis. 
20Q 
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executing small vibrations can be written 


E = $ X miith E kisti, (100.1) 


where m,,, kip are constant coefficients, and the u; are the components of 
the vector displacements of the particles from their equilibrium positions 
(the suffixes i, R denumerate both the components of the vector and the 
particles). By a suitable linear transformation of the quantities u,, we can 
eliminate from (100.1) the coordinates corresponding to translational motion 
and rotation of the system, and take the vibrational coordinates in such a 
way that both the quadratic forms in (100.1) are transformed into sums of 
squares. Normalizing these coordinates so as to make all the coefficients 
in the expression for the kinetic energy unity, we obtain the vibrational energy 
in the form 


E = $2 Oy?-+4 Z Wy" z Qai”. (100.2) 


The vibrational coordinates Q,4 are said to be normal; the w, are the fre- 
quencies of the corresponding independent vibrations. It may happen that 
the same frequency (which is then said to be multiple) corresponds to several 
normal coordinates; the suffix « to the normal coordinate gives the number 
of the frequency, and the suffix 7 = 1, 2, ..., fa numbers the coordinates 
belonging to a given frequency (f, being the multiplicity of the frequency). 

The expression (100.2) for the energy of the molecule must be invariant 
with respect to symmetry transformations. This means that, under any 
transformation belonging to the point symmetry group of the molecule, the 
normal coordinates Qai i = 1, 2, ..., fa (for any given a) are transformed 
into linear combinations of themselves, in such a way that the sum of the 
squares 2 O12 remains unchanged. In other words, the normal coordinates 


belonging to any particular eigenfrequency of the vibrations of the molecule 
give some irreducible representation of its symmetry group; the multiplicity 
of the frequency determines the dimension of the representation. The 
irreducibility follows from the same considerations as were given in §96 for 
the solutions of Schrédinger’s equation. The equality of the frequencies 
corresponding to two different irreducible representations would be an 
improbable coincidence. A reservation is again necessary: since the normal 
coordinates are by their physical nature real quantities, two complex 
conjugate representations correspond physically to one eigenfrequency of 
twice the multiplicity. 

_These considerations enable us to carry out a classification of the eigen- 
vibrations of a molecule without solving the complex problem of actually 
determining its normal coordinates. To do so, we must first find (by the 
method described below) the representation given by all the vibrational 
coordinates together, which we shall call the total vibrational representation; 
this representation ts reducible, and on decomposing it into irreducible parts 
we determine the multiplicities of the eigenfrequencies and the symmetry 
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properties of the corresponding vibrations. Here it may happen that the 
same irreducible representation appears several times in the total represen- 
tation; this means that there are several different frequencies of the same 
multiplicity and with vibrations of the same symmetry. 

To find the total vibrational representation, we start from the fact that the 
characters of a representation are invariant with respect to a linear transforma- 
tion of the base functions. Hence they can be calculated by using as base 
functions not the normal coordinates, but simply the components u; of the 
vectors of the displacements of the nuclei from their equilibrium positions. 

First of all, it is evident that, to calculate the character of some element 
G of a point group, we need consider only those nuclei which (or, more exactly, 
whose equilibrium positions) remain fixed under the given symmetry trans- 
formation. For if, under the rotation or reflection G in question, nucleus 1 
is moved to a new position, previously occupied by a similar nucleus 2, 
this means that under the operation G a displacement of nucleus’! is trans- 
formed into a displacement of nucleus 2. In other words, there will be no 
diagonal elements in the rows of the matrix G,, which correspond to this 
nucleus (i.e. to its displacement u). The components of the displacement 
vector of a nucleus whose equilibrium position is not affected by the operation 
G, on the other hand, are evidently transformed into combinations of them- 
selves, so that they may be considered independently of the displacement 
vectors of the remaining nuclei. 

Let us first consider a rotation C(¢) through an angle ¢ about some sym- 
metry axis. Let uz, uy, U be the components of the displacement vector 
of some nucleus, whose equilibrium position is on the axis, and hence is 
unaffected by the rotation. Wader the rotation these components are 
transformed, like those of any ordinary (polar) vector, according to the for- 
mulae (the z-axis being the axis of symmetry) 


u', =u, cosd+u, sing, 


u’, = —u,sin¢d+u, Cos, 


The character, i.e. the sum of the diagonal terms of the transformation matrix, 
is 14+2cos¢. If altogether Ng nuclei lie on the axis in question, the total 
character is 


No(1+2 cos¢). (100.3) 


However, this character corresponds to the transformation of all the 3N 
displacements u,; hence it is necessary to separate the part corresponding to 
the transformations of translation and (small) rotation of the molecule as a 
whole. The translation is determined by the displacement vector U of the 


centre of mass of the molecule; the corresponding part of the character iS 
therefore 1+2cos¢. The rotation of the molecule as a whole is determined 
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by the vector 6&2 of the angle of rotation.t The vector 6&2 is axial, but with 
respect to rotations of the coordinate system an axial vector behaves like 
a polar vector. Hence a character of 1 +2 cos ¢ also corresponds to the vector 
582. Altogether, therefore, we must subtract from (100.3) a quantity 
2(1 +2 cos¢). Thus we finally have the character y(C) of the rotation C(¢) 


in the total vibrational representation: 
y(C) = (Nc—2)(1+2 cos¢). (100.4) 


The character of the unit element is evidently just the total number of 
vibrational degrees of freedom: y(E) = 3N —6 (as is obtained from (100.4) 
when Nc = dV, d = 0). 

In a similar manner, we calculate the character of the rotary-reflection 
transformation S(¢) (a rotation through an angle ¢ about the z-axis and a 
reflection in the xy-plane). Here a vector is transformed according to the 
formulae 


ui, =u, cosd+u, sind, 
uy = —u,81Ind+u, cose, 
P = —h,, 


to which there corresponds a character —1 +2 cosġ. Hence the character 
of the representation given by all the 3N displacements u, is 


Ns(—1+2 cos), (100.5) 


where NV, is the number of nuclei left unmoved by the operation S(¢); this 
number is evidently either none or one. To the vector U of the displacement 
of the centre of mass there corresponds a character —1 +2 cosġ. The vector 
6&2 being an axial vector, is unchanged by an inversion of the coordinate 
system; on the other hand, the rotary—reflection transformation S(¢) can be 
represented in the form 


S(¢) = C(p)on = C(P)Cpl = Cr+), 


i.e. as a rotation through an angle 7+, followed by an inversion. Hence 
the character of the transformation S(¢) applied to the vector 8Q is equal 
to the character ‘of the transformation C(#+¢) applied to an ordinary vector, 
i.e. it is 1 +2 cos (7+¢) = 1—2 cos¢. The sum (—1+2 cos¢)+(1—2 cos ¢) 
= 0, so that we reach the conclusion that the expression (100.5) is equal to 


the required character x(S) of the rotary—reflection transformation S(¢) in 
the total vibrational representation: 


x(S) = Ns(—1+2 cos ¢). (100.6) 


In particular, the character of reflection in a plane (¢ = 0) is x(o) = N,, 
while that of an inversion ($ = 7) is x(I) = —3N r 


+ Asis well known, the angle of a small rotation can be regarded as a vector 8Q, whose modulus 
is equal to the angle of rotation and which is directed along the axis of rotation in the direction 
determined by the corkscrew rule. The vector 5Q so defined is clearly axial. 
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Having thus determined the characters y of the total vibrational representa- 
tion, we have only to decompose it into irreducible representations, which 1s 
done by formula (94.16) and the character tables given in §95 (see the Prob- 
lems at the end of the present section). 

To classify the vibrations of a linear molecule there is no need to have 
recourse to group theory. The total number of vibrational degrees of freedom 
is 3N—5. Among the vibrations, we must distinguish those in which the 
atoms remain in a straight line, and those where this does not happen.t The 
number of degrees of freedom in the motion of N particles in a straight line 
is N; of these, one corresponds to the translational motion of the molecule 
as a whole. Hence the number of normal coordinates of the vibrations 
which leave the atoms in a straight line is N—1; in general, N—1 different 
eigenfrequencies correspond to them. The remaining (3N—5)—(N—1) 
= 2N—4 normal coordinates relate to vibrations which destroy the col- 
linearity of the molecule; to these, there correspond N —2 different double 
frequencies (two normal coordinates, corresponding to the same vibrations 
in two mutually perpendicular planes, belong to each frequency).J] 


PROBLEMS 


Pros_em 1. Classify the normal vibrations of the molecule NH, (an equilateral triangular 
pyramid, with the N atom at the vertex and the H atoms at the corners of the base; Fig. 41). 





Fic. 41 


SOLUTION. ‘The point symmetry group of the molecule is Cse. Rotations about an axis 
of the third order leave only one atom (N) fixed, while reflections in planes each leave two 
atoms fixed (N and one H). From formulae (100.4), (100.6) we find the characters of the 
total vibrational representation: 

E 2C; 30, 


6 0 a 





Decomposing this representation into irreducible parts, we find that it contains the repre- 
sentations A, and E twice each. ‘Thus there are two simple frequencies corresponding to 
vibrations of the type 4,, which conserve the complete symmetry of the molecule (what are 
called totally symmetric vibrations), and two double frequencies with corresponding normal 
coordinates which are transformed into combinations of each other by the representation E. 


PROBLEM 2. The same as Problem 1, but for the molecule H,O (Fig. 42). 


SOLUTION. The symmetry group is Cze. The transformation C, leaves the O atom fixed; 
the transformation ce (a reflection in the plane of the molecule) leaves all three atoms fixed ; 


+ If the molecule is symmetrical about its centre, a further characteristic of the vibrations 
appears; see Problem 10 at the end of this section. 

{ Using the notation for the irreducible representations of the group Cor (see §98), we can say 
that there are N — 1 vibrations of the type 41, and N—2 of the type E£}. 
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the reflection o’, leaves only the O atom fixed. The characters of the total vibrational repre- 
sentation are 


lad 
pd 
ta} 


This representation divides into the irreducible representations 2A,, 1B, i.e. there are two 
totally symmetric vibrations and one with the symmetry given by the representation B,; all 
the frequencies are simple. Fig. 42 shows the corresponding normal vibrations. 


PROBLEM 3. The same as Problem 1, but for the molecule CHCI, (Fig. 43a). 


SOLUTION. The symmetry group of the molecule is C,,. By the same method we find 
that there are three totally symmetric vibrations A, and three double vibrations of the type E£. 


PROBLEM 4. The same as Problem 1, but for the molecule CH, (the C atom is at the centre 
of a tetrahedron with the H atoms at the vertices; Fig. 43b). 


SOLUTION. The symmetry of the moletule is T}. The vibrations are 14,, 1E, 2F,. 


PROBLEM 5. ‘The same as Problem 1, but for the molecule C,H, (Fig. 43c). 


SOLUTION. The symmetry of the molecule is Dep. The vibrations are 2Aig, 1Asy, Lagu, 
1Byg, 1Byu, 1By¢, 3Byu, lE, 3E iu, 4E x9, 2E yu. 


PROBLEM 6. ‘The same as Problem 1, but for the molecule OsF, (the Os atom is at the 
centre of a cube with the F atoms.at the vertices; Fig. 43d). 

SOLUTION. ‘The symmetry of the molecule is O,. The vibrations are 14,¢, 1 1E 
1Eu, 2Fiu, 2F ro, 1Fyu. 1g) Agu, Gy 

PROBLEM 7. ‘The same as Problem 1, but for the molecule UF the U atom i 
of an octahedron with the F atoms at the vertices; Fig. 43e). - O lating 


SOLUTION. The symmetry of the molecule is Or. The vibrations are 1Aig, 1E 9, 2Fiu 
1 Foe, 1 Fy. 2 » 1t» 
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PROBLEM 8. ‘The same as Problem 1, but for the molecule C,H, (Fig. 43f). 


SOLUTION. The symmetry of the molecule is Dza. ‘The vibrations are 3Aig, 1Ayu, 2Asu, 
3E 9, 3Eu. 


PROBLEM 9. The same as Problem 1, but for the molecule C,H, (Fig. 43g; all the atoms 
are coplanar). 


SOLUTION. ‘The symmetry of the molecule is Dz}. The vibrations are 3Aig, 1Aiu, 2Bi¢, 
1Biz, 2Bsu, 1Bag, 2Bzu; the axes of coordinates are taken as shown in the figure. 


ProBLEM 10. The same as Problem 1, but for a linear molecule of N atoms symmetrical 
about its centre. 


SoLUTION. ‘To the classification of the vibrations of a linear molecule considered in the 
text, we must add the classification from the behaviour with respect to inversion in the centre. 
There are two distinct cases, according as N is even or odd. 

If N is even (N = 2p), there is no atom at the centre of the molecule. On giving to the 
p atoms in one half of the molecule independent displacements along the line, and to the re- 
maining p atoms equal and opposite displacements, we find that p of the vibrations leaving 
the atoms in line are symmetrical with respect to the centre, while the remaining (2p—1) —pP = 
p—1 vibrations of this type are antisymmetrica]. Next, p atoms have 2p degrees of freedom 
for motions in which the atoms do not remain inline. On giving equal and opposite displace- 
ments to symmetrically placed atoms, we should obtain 2p symmetrical vibrations; of these, 
however, the two corresponding to a rotation of the molecule must be removed. Thus there 
are p —1 double frequencies of vibrations which bring the atoms out of line and are symmetri- 


cal about the centre, and the same number [(2—2) —(p—1) = p—1] which are antisym- 
metrical, Using the notation for the irreducible representations of the group Don (see 
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the end of §98), we can say that there are p vibrations of the type Ag and p—1 of the types 
Ayu, Eig, Eiu. 

If Nis odd (N = 2p +1), similar arguments show that there are p vibrations of each of the 
types Aip, Au, Eu and p-—l of the type Eig. 


§101. Vibrational energy levels 


From the viewpoint of quantum mechanics, the vibrational energy of a 
molecule is determined by the eigenvalues of the Hamiltonian 


fa 
HO =} D E (Pa? +w Qai’), (101.1) 


where Ê ; = —ih0/@Q,; are the momentum operators corresponding to the 
normal coordinates Q.;. Since this Hamiltonian falls into the sum of in- 


~ 


dependent terms (P,;2 + w."Q,;7), the energy levels are given by the sums 


EY = AE wg È (get) = E holta), (101.2) 


where v, = È v,» and f is the multiplicity of the frequency w,. The wave 
functions are given by the products of the corresponding wave functions for 
linear harmonic oscillators: 


4 = Hya, (101.3) 
where #, = constant x exp{—4c,? -= Qui”) He), (101.4) 


where H, denotes the Hermite polynomial of order v, and ¢, = V/(w,/h). 
If there are multiple frequencies among the w,, the vibrational energy levels 
are in general degenerate. The energy (101.2) depends only on the sums 
Uy = Ev, Hence the degree of degeneracy of, the level is equal to the 
number of ways of forming the given set of numbers v, from the v; For a 
single number v, 1t 1st 

(v,-+f,—1)!/2,!(f,—D)!- 


Hence the total degree of degeneracy is 


(t+-f,—1)! 
Į] E 8 (101.5) 


For double frequencies, the factors in this product are v, +1, while for triple 
frequencies they are (v, +1)(v, + 2). 

It must be borne in mind that this degeneracy occurs only so long as we 
consider purely harmonic vibrations. When terms of higher order in the 
normal coordinates are taken into account in the Hamiltonian (anharmonic 


vibrations), the degeneracy is in general removed, though not completely 
(see §104 for a further discussion of this point). 


t This is the number of ways in which vą balls can be distributed among f, urns. 
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The wave functions (101.3) belonging to the same degenerate vibrational 
term give some representation (in general reducible) of the symmetry group 
of the molecule. The functions belonging to different frequencies are 
transformed independently of one another. Hence the representation given 
by all the functions (101.3) is the product of the representations given by the 
functions (101.4), so that we need consider only the latter. 

The exponential factor in (101.4) is invariant with respect to all the 
symmetry transformations. In the Hermite polynomials, the terms of any 
given degree are transformed only into similar terms; a symmetry trans- 
formation evidently does not change the degree of any term. Since, on the 
other hand, each Hermite polynomial is completely determined by its highest 
term, it follows that it is sufficient to consider only the highest term, writing 


a 
ah Es, (cae) = Constant x Og Quen ene Qos Sa 


+ terms of lower degree. 


The functions for which the sum v, = È v, has the same value belong 
to the same term. Thus we have a representation given by the products of 
v, quantities Q, ;; this is just the symmetric product (see §94) of the irreducible 
representation given by the Q,; with itself v, times (L. Tisza 1933). 

For one-dimensional representations, the finding of the characters of their 
symmetric products with themselves v times is trivial :T 


x(G) = [x(G)] P, 


For two- and three-dimensional representations it is convenient to use the 
following mathematical device.f The sum of the squared base functions of 
an irreducible representation is invariant with respect to all symmetry 
transformations. Hence we can formally regard these functions as the com- 
ponents of a vector in two or three dimensions, and the symmetry transfor- 
mations as some rotations (or reflections) applied to these vectors. We 
emphasize that there is in general no relation between these rotations and 
reflections and the actual symmetry transformations, the former depending 
(for any given element G of the group) also on the particular representation 
considered. 

Let us consider two-dimensional representations more closely. Let x(G) 
be the character of some element of the group in the two-dimensional repre- 
sentation concerned, with x(G) # 0. The sum of the diagonal elements of 
the transformation matrix for the components x, y of a two-dimensional 
vector on rotation through an angle ¢ in a plane is 2 cos. Putting 


2 cosġ = x(G), (101.6) 


we find the angle of the rotation which formally corresponds to the element 


+ We use the notation x(G) in place of the cumbersome [x"] (G). 
t It was applied to this problem by A. S. Kompaneets (1940). 
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G in the irreducible representation considered. The symmetric product of 
the representation with itself v times is the representation whose basis is 
formed by the v+1 quantities x", x*"ly, ..., y". The characters of this 
representation aret 


XG) = sin(v+1)¢/sin¢. (101.7) 


The case where x(G) = 0 requires special consideration, since a zero charac- 
ter corresponds both to a rotation through r and to a reflection. If 
x(G?) = —2, we have a rotation through 47, and for y,(G) we obtain 


x(G) = —4(-1)”[1 +(-1)*]. (101.8) 


If x(G*) = 2, on the other hand, x(G) must be regarded as the character 
of a reflection (i.e. a transformation x —> x, y > —y); then 


x(G) = ${1+(—1)*]. (101.9) 


We can similarly obtain the formulae for the symmetric products of three- 
dimensional representations. The finding of the rotation or reflection which 
formally corresponds to an element of the group in a given representation is 
easily accomplished with the aid of Table 7 (§95). This is the transformation 
which corresponds to the given x(G) in that isomorphous group in which the 
coordinates are transformed by the representation in question. Thus, for 
the representation F, of the groups O and T , we must take a transformation 
from the group O, but for the representation F, we must take one from the 
group Ta We shall not pause here to derive the corresponding formulae for 
the characters y,(G). 


§102. Stability of symmetrical configurations of the molecule 


For a symmetrical position of the nuclei, an electron term of the molecule 
may be degenerate, if there are among the irreducible representations of the 
symmetry group one or more whose dimensions exceed unity. We may ask 
whether such a symmetrical configuration is a stable equilibrium configura- 
tion of the molecule. Here we shall neglect the effect of spin (if any), which 
is usually insignificant in polyatomic molecules. The degeneracy of the 
electron terms of which we shall speak is therefore only the orbital degeneracy, 
and is unrelated to the spin. 

If the configuration in question is stable, the energy of the molecule as a 
function of the distances between the nuclei must be a minimum for the given 
position of the nuclei. This means that the change in the energy due to a small 
displacement of the nuclei must contain no terms linear in the displacements. 


t For purposes of calculation it ıs convenient to take the base functions in the form 
(x+ty)", (x+14y)""'(x—iy), .., (x—iy)”; 


the matrix of the rotation is then diagonal, and the sum of the diagonal elements takes the form 


etre + glit—23¢6 Heee enite 
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Let A be the Hamiltonian of the electron state of the molecule, the 
distances between the nuclei being regarded as parameters. We denote by A, 
the value of this Hamiltonian for the symmetrical configuration considered. 
The quantities defining the small displacements of the nuclei can be taken 
as the normal vibrational coordinates Q,;. The expansion of H in powers 
of the Q; is of the form 

H=H o+ VoiQait, Z W oibr Qai Opt ee.. (102.1) 
The expansion coefficients V, W, ... are functions only of the coordinates of 
the electrons. Under a symmetry transformation, the quantities Q,; are 
transformed into combinations of one another, and the sums in (102.1) are 
changed into other sums of the same form. Hence we can formally regard 
the symmetry transformation as a transformation of the coefficients in these 
sums, the Q,, remaining unchanged. Here, in particular, the coefficients 
V., (for any given «) will be transformed by the same representation of the 
symmetry group as the corresponding coordinates Qai. This follows at 
once from the fact that, by virtue of the invariance of the Hamiltonian under 
all symmetry transformations, the group of terms of any given order in its 
expansion must be invariant also, and in particular the linear terms must be 
invariant. 

Let us consider some electron term E, which is degenerate in the sym- 
metrical configuration. A displacement of the nuclei which destroys the 
symmetry of the molecule generally results in a splitting of the term. The 
amount of the splitting is determined, as far as terms of the first order in 
the displacements of the nuclei, by the secular equation form.żd from the 
matrix elements of the linear term in the expansion (102.1), 


Koo = 2 Ox: { PV aito dq, (102.2) 


where Y, Yo are the wave functions of electron states belonging to the 
degenerate term in question (and are chosen to be real). The stability of the 
symmetrical configuration requires that the splitting linear in Q should be 
zero, i.e. all the roots of the secular equation must vanish identically. ‘This 
means that the matrix V,, must itself be zero. Here, of course, we must 
consider only those normal vibrations which destroy the symmetry of the 
molecule, i.e. we must omit the totally symmetric vibrations (which cor- 
respond to the unit representation of the group). 

Since the Q,, are arbitrary, the matrix elements (102.2) vanish only if 
all the integrals 


f Vaio dg (102.3) 





+ Strictly speaking, the quantities Vei must be transformed by the representation which is 
the complex conjugate of that by which the Qai are transformed. However, as we have 
already pointed out, if two complex conjugate representations are not the same, they must 
physically be considered together as one representation of twice the dimension. The above 
remark is therefore unimportant. 
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vanish. Let Dé! be the irreducible representation by which the electron 
wave functions ys, are transformed, and D, the same for the quantities V,,; 
as we have already remarked, the representations D, are those by which the 
corresponding normal coordinates Q,; are transformed. According to the 
results of §97, the integrals (102.3) will be non-zero if the product 
[D(e2] x D, contains the unit representation or, what is the same thing, 1f 
[D'e52] contains D,. Otherwise all the integrals vanish. 

Thus a symmetrical configuration is stable if the representation [D(e2] 
does not contain any (except the unit representation) of the irreducible 
representations D, which characterize the vibrations of the molecule. This 
condition is always satisfied for non-degenerate electron states, since the sym- 
metric product of a one-dimensional representation with itself is the unit 
representation. 

Let us consider, for instance, a molecule of the type CH,, in which one 
atom (C) is at the centre of a tetrahedron, with four atoms (H) at the vertices. 
This configuration has the symmetry Ta. The degenerate electron terms 
correspond to the representations E, F}, F; of this group. The molecule has 
one normal vibration A, (a totally symmetric vibration), one double vibration 
E, and two triple vibrations F, (see §100, Problem 4). The symmetric pro- 
ducts of the representations E, Fi, Fa with themselves are 


[E°] = 4,+8, [F7] = [F.")] = 4 HE+ F 


w 


We see that each of these contains at least one of the representations E, F, 
and hence the tetrahedral configuration considered is unstable when there 
are degenerate electron states. 

This result constitutes a general rule, the Jahn-Teller theorem (H. A. Jahn 
and E. Teller 1937): when there is a degenerate electron state, any symmetri- 
cal position of the nuclei (except when they are collinear) is unstable. As a 
result of this instability, the nuclei move in such a way that the symmetry of 
their configuration is destroyed, the degeneracy of the term being completely 
removed. In particular, we can say that the normal electron term of a 
symmetrical (non-linear) molecule can only be non-degenerate.t 

As we have just mentioned, the linear molecules alone form an exception. 
This 1s easily seen, without using group theory. A displacement of a nucleus 
whereby it moves off the axis of the molecule is an ordinary vector with ¢¿ 
and » components (the ¢-axis being along the axis of the molecule). We 
have seen in §87 that such vectors have matrix elements only for transitions 
in which the angular momentum A about the axis changes by unity. On the 
other hand, to a degenerate term of a linear molecule there correspond states 
with angular momenta A and —A about the axis (A > 1). A transition be- 
tween them changes the angular momentum by at least 2, and therefore the 

t The physical idea of destruction of symmetry in an electron state that is degenerate 
because of the same symmetry is due to L. D. Landau (1934). Jahn and Teller proved the 


theorem by considering all possible types of symmetrical configuration of the nuclei in the 
molecule and examining each one by the method given above. 
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matrix elements always vanish. Thus the linear position of the nuclei in the 
molecule may be stable, even if the electron state is degenerate. 

A constructive general proof of the theorem is based on the following 
consideration (E. Ruch 1957). The degeneracy of electron states due to the 
symmetry of the configuration of the nuclei can exist only in point symmetry 
groups of the molecule which include at least one rotation (Cn) or rotary- 
reflection (Sn) axis of order n > 2. Then the wave functions of mutually 
degenerate states (i.e. the base functions of the corresponding representation 
D¢el)) include at least one for which the electron density p = |4|? = $ is 
not invariant under rotations about this axis; the electric field due to the 
electrons, like the electron density, will not be symmetrical about the axis. 
In a (non-linear) molecule, there are equivalent nuclei, not on the axis, 
which move to one another’s positions in the rotation Cy, or Sn. Thus 
equivalent nuclei lie at points that are not equivalent as regards the electric 
field. But an equivalence of the equilibrium positions of charged particles 
in a field that is not required by the symmetry of the field is impossible, in 
the sense that it can only result from an unlikely coincidence. 

The systematic proof is a mathematical embodiment of this physical 
situation. We shall indicate the structure of the proof (E. Ruch and A. 
Schénhofer 1965). 

Let us consider (in a non-linear molecule) some nucleus a that 1s not at 
the “centre” of the molecule (i.e. not at the fixed point for the transformations 
in its symmetry group) and not on the principal axis of symmetry, if any.T 
Let H be the set of all symmetry transformations of the molecule that leave 
the nucleus a fixed; H is a sub-group of the total symmetry group G of the 
molecule, and may be one of the point groups Cj, C. Ca, Cas: Thetrans= 
formations in G that are not in H move the nucleus a to the positions of other 
equivalent nuclei a’, a’’, ... ; let s be the number of nuclei in this set. It 
is evident that the order of the sub-group H is g/s, where g is the order of 
the whole group G (i.e. s is the index of the sub-group H in the group G).] 

The number s is certainly at least 3, since for the existence of an irreducible 
representation D(e) with dimension exceeding unity it is necessary (as 
already mentioned) that there should be at least one axis of symmetry of 
higher than the second order, and the nucleus a is not on this axis, by the 
condition stated. 

The representation D'eD of the group G is in general reducible with respect 
to the group H, which has a lower symmetry. Let us suppose that its de- 
composition into irreducible representations of H includes a representation 
d(el) of dimension unity. This is given by an electron wave function %, one 
of the base functions of the representation D‘e). Since the dimension of 
d‘el) is unity, p = y? is invariant under all transformations in H, i.e. it gives 
a unit irreducible representation of this group. 


+ By the principal axis we mean (in symmetry groups other than cubic and icosahedron) 
the axis Cn or Sn with n> 2. 

t All the elements of the group G can be divided into s cosets H, G'H, G''H, ..., where G', 
G”, ..., are the elements that move the nucleus a to a’, a”, ..., 
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Such a unit representation of H can also be obtained by taking as the basis 
one of the displacements Qa of the atom a: the displacement along the 
position vector of the nucleus a from the centre of the molecule. 

By applying to this displacement all the operations in the group G, we 
obtain the basis of a representation Dg (in general reducible) of this group. 
Since every transformation in G that is not in H changes the displacement Qa 
into a displacement of one of the other s—1 equivalent nuclei a’, a’, ..., 
and the displacements of different nuclei are of course linearly independent, 
the dimension of Dg is s. The displacements Qa, Qa., ... which form the 
basis of Dg certainly cannot correspond to either pure translation or pure 
rotation of the molecule as a whole: if there are three or more equivalent 
nuclei, radial displacements of them cannot be combined to form such 
displacements of the molecule. 

In a similar way, we can find a representation D, of the group G by 
applying all its transformations to the function p = 2. The dimension of 
D, may be s, but may also be less, since there is no reason to suppose that all 
the s functions p, G'p, G" p, ... are linearly independent. We can, however, 
say that the representation D,, if not the same as Dg, will always be entirely 
contained in Dg.t Moreover, it is not the unit representation, since 2 is 
certainly not invariant with respect to the whole group G; only the sum of 
the squares of the base functions of the irreducible representation DeD, with 
dimension exceeding unity, is invariant. 

These properties of the representations Dg and D, give the required 
result immediately: Dg is part of the total vibrational representation, and D, 
is part of the representation [D(¢92], not containing the unit representation. 
The fact that Dg contains D, therefore means that [D(¢2] contains at least 
one of the non-unit vibrational representations D,, as was to be proved. 

In these arguments, however, it has also been assumed that the decom- 
position of De) into irreducible representations of the sub-group H includes 
a representation of dimension unity. This assumption is correct in the 
great majority of cases. For example, it is certainly correct if H = 
Ci, Cs, C2, Coy (since all irreducible representations of these groups have 
dimension unity). It is certainly correct also if H = Cn, Cnv with n > 2, 
provided that the dimension of D'eD is odd (since the groups Cn and Cy, have 
only irreducible representations of dimension 1 or 2). An examination of the 
character tables of the irreducible representations of point groups shows 
that an exception occurs for the two-dimensional representations of the 
cubic groups G = O, Ta, On, with respect to the sub-groups H = Cs, Cy». 

Let us take the particular case of G = O and H = C3; this affects only 





t The significance of this statement is as follows. Let one representation (dimension f ) of the 
sub-group H be given by different sets of base functions, and let one set, when all the transfor- 
mations in G are applied to it, vive a representation of G with dimension sf, where s is the index 
of the sub-group H in the group G. ‘Then we can say that the representation of the group G 
obtained by the same method from any other such set of functions is either the same as the first 


or entirely contained in it. A rigorous proof is given by E. Ruch and A. Schönhofer, Theoretica 
Chimica Acta 3, 291, 1965. 
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the naming of the representations. The two electron functions y1, We give 
the representation DD = E of the group O, and the representation 
d'eD — E of the sub-group C3; the representation of the latter given by the 
products 4:2, 42, yı ye, is [E27] = A+E. A similar representation of C3 
is obtained with the three components of the vectors of any displacement 
Oa of the nucleus a as basis. The representation D, of O is in this case 
[D(eb2] = A,+E, and does not contain the representation Fz corresponding 
to the vector of translation or rotation of the molecule as a whole; it contains 
both the unit and a non-unit representation. Hence the fact that D, is 
(for the same reasons as previously) contained in Dg (whose dimension here 
is 3s) proves that in this case also the molecule is unstable.t 

In accordance with the remark at the beginning of this section, the whole 
of the above discussion has regarded the degeneracy of the electron states 
as being purely orbital in origin. It may be mentioned, however, that the 
Jahn-Teller theorem remains valid even when the spin-orbit and spin-spin 
interactions are taken into account, the only difference being that in (non- 
linear) molecules with half-integral spin the Kramers double degeneracy 
does not cause instability, ın accordance with the general theorem proved in 
§60. The latter case corresponds to the two-dimensional two-valued 
irreducible representations of the double point groups. The absence of 
instability in this case may be seen by the following formal argument. To 
determine the selection rules for matrix elements (102.3) in the case of 
two-valued representations Dte), we must consider the antisymmetric 
products {D(e2} not the symmetric ones (see §99). But, for every two- 
valued irreducible representation with dimension 2, these products are the 
unit representation, t.e. they certainly do not contain representations 
corresponding to any vibrations of the molecule that are not totally 
symmetrical. 


§103. Quantization of the rotation of a top 


The investigation of the rotational levels of a polyatomic molecule is often 
hampered by the necessity of considering the rotation simultaneously with 
the vibrations. As a preliminary example, let us consider the rotation of a 
molecule as a solid body, i.e. with the atoms “rigidly fixed” (a top). 

Let £, n, ¢ be a system of coordinates with axes along the three principal 
axes of inertia of a top, and rotating with it. The corresponding Hamiltonian 
is obtained by replacing the components /,, J,, J; of the angular momentum 
of the rotation, in the classical expression for the energy, by the corresponding 
operators: 


9 Jè 1 I? 103.1 
Ĥ =4h Eaa) (103.1) 


where I4, Ip, Ic are the principal moments of inertia of the top. 


+ One further exceptional case comprises the four-dimensional representations of the 
icosahedron groups. This is treated similarly, and gives the same result. 
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The commutation rules for the operators Pa. J: of the angular momen- 
tum components in a rotating system of coordinates are not obvious, since 
the usual derivation of the commutation rules relates to the components 
l, js J z in a fixed system of coordinates. ‘They are, however, easily obtained 
by using the formula 


(J.a)(J.b)—(J.b)(J.a) = —iJ.axb, (103.2) 


where a, b are any two commuting vectors which characterize the body in 
question. This formula is easily verified by calculating the left-hand side 
of the equation in the fixed system of coordinates x, Y, z, using the general 
rules for the commutation of angular momentum components with one 
another and with the components of an arbitrary vector. 

Let a and b be unit vectors along the £ and ņ axes. Then axb is a unit 
vector along the ¢-axis, and (103.2) gives 


Jii-Jii = —if¢. (103.3) 


Two other relations are obtained similarly. Thus the commutation rules for 
the operators of the angular momentum components in the rotating system of 
coordinates differ from those in the fixed system only in the sign on the right- 
hand side of the equation.t Hence it follows that all the results which we 
have previously obtained from the commutation rules, relating to the 
eigenvalues and matrix elements, hold for Ja Jy Ji also, with the difference 
that all expressions must be replaced by their complex conjugates. In 
particular, the eigenvalues of J, (which will be denoted in this section by k, 
whereas the eigenvalues of Jz are denoted by M) take the values k = — 1, 
-~ +J, where J, an integer, is the magnitude of the angular momentum of 
the top. 


THE SPHERICAL TOP 


The finding of the eigenvalues of the energy of a rotating top is simplest 
for the case where all three principal moments of inertia of the body are 
equal: I4 = Ig = Ic= J. This holds for a molecule in cases where it has 
the symmetry of one of the cubic point groups. The Hamiltonian (103.1) 
takes the form 

H = hJ?/21, 
and its eigenvalues are 


E = h?J(J+1)/21. (103.4) 


Each of these energy levels is degenerate with respect to the 2J + 1 directions 


of the angular momentum relative to the body itself (i.e. with respect to the 
values of J, = k).{ 





t This expresses the fact that, as regards its effect on the wav 
rotation of the system x, y, z is equivalent to an opposite rotation of the system £, n, Č. 


This degeneracy always 


occurs, and ts not physically important. If it is included, the total degree of degeneracy of 


the energy levels of a spherical top is (27+ 1)?. 
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There is also no difficulty in calculating the energy levels in the case where 
only two of the moments of inertia of the top are the same: J4 = Ip # Ic. 
This holds for molecules having one axis of symmetry of order above the 
second. The Hamiltonian (103.1) takes the form 


A= jet j, ath} 2/2Ic 


= me Fi2tt ah (———) Ta (103.5) 
Hence we see that, in a state with given values of J and k, the energy is 
TS eat +1 yanne(——— Vee (103.6) 
21 4 Ic Ia 


which determines the energy levels of a symmetrical top. 

The degeneracy with respect to values of k which occurred for a spherical 
top is here partly removed. The values of the energy are the same only for 
values of k differing in sign alone, corresponding to opposite directions of 
the angular momentum relative to the axis of the top. Thus the energy 
levels of a symmetrical top are (for k # 0) doubly degenerate. 

The stationary states of a symmetrical top are thus described by three 
quantum numbers: the angular momentum / and its components along the 
axis of the top (J, = k) and along the z-axis fixed in space (Jz = M); the 
energy of the top is independent of the last of these. The simultaneous 
measurability of the angular momentum and its components along an axis 
fixed in space and along an axis rigidly attached to a physical systemt 
follows since the operators J? and J, commute not only with each other but 
also with the operator J, = jn, where n is a unit vector along the ¢-axis. 
This is easily shown by direct calculation, but 1s also obvious a priori: the 
angular momentum operator is that of an infinitesimal rotation, and the 
scalar product J.n of two vectors fixed to the top is invariant under any 
rotation of the coordinate axes. 

The determination of the wave functions of the stationary states of a 
symmetrical top therefore amounts to finding the common eigenfunctions 
of the operators J®, fz and Je This is in turn mathematically dependent on 
the law of transformation of the angular momentum eigenfunctions under 
finite rotations. With a change in the notation for the quantum numbers, we 
can write this law (58.7) as 


dou = È Dina, P, yhy (103.7) 


We shall take yyy to be the wave function of a state of the top described in 
terms of the fixed coordinates x, v, z, and the yx to be the wave functions of 
states described in terms of the axes £, 7, č attached tothe top. In coordinates 


+ Not to be confused with the components (not simultaneously measurable) along two axes 
fixed in space. 
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rigidly attached to a physical system (such as the top), the yx have definite 
values y% independent of the spatial orientation of the system. Formula 
(103.7) gives the angle dependence of the Wyar. Let the state JM) also 
have a definite value & of the angular-momentum component along the ¢-axis. 
This means that only the ¥ with that k will be non-zero. Then the sum 
in (103.7) reduces to one term: 


wom = PRD, É; y). 


This gives the dependence of the wave functions of the states IJMRkY on 
the Eulerian angles, which define the rotation of the axes of the top with 
respect to the fixed axes. Normalizing the wave function by the condition 


f lYyare|? sin £ da dB dy = 1, 


we have 
M PE 
damn = i |E DA B y); (103.8) 


the phase factor is chosen so that, for k = 0, the function (103.8) becomes the 
eigenfunction of the free (not attached to the Ç-axis) integral angular momen- 
tum J with component M, i.e. the ordinary (spherical harmonic) function; 


cf. (58.25).+ 


THE ASYMMETRICAL TOP 

For I4 # Ip # Ic, the calculation of the energy levels in a general form is 
impossible. The degeneracy with respect to the directions of the angular 
momentum relative to the top is here removed completely, so that 2 J+1 
different non-degenerate levels correspond to any given J. To calculate 
these levels (for a given J} we have to start from Schrédinger’s equation 
written in matrix form (O. Klein 1929). This is done as follows. 

The wave functions wz of states of the top with definite values of J and the 
¢-component of the angular momentum are the functions (103.8) derived 
above (we shall omit for brevity the suffix M giving the z-component of the 
angular momentum, since the energy does not depend on this); in these 
states the energy of the asymmetrical top does not have definite values. 
In the stationary states, on the other hand, the component J; does not have 
definite values, i.e. no definite values of k can be assigned to the energy levels. 
The wave functions of these states are sought as linear combinations 


br = È cebars (103.9) 
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it is assumed that all functions have some common value of M. Substitution 
in Schrédinger’s equation Hwy = Eyyy leads to the equations 


`, (CRIA RD — Eô; der = 0, (103.10) 
and the condition for these to have a solution leads to the secular equation 
| JRIA|JR'> — Erg] = 0. (103.11) 


The roots of this equation give the energy levels of the top, and then the 
equations (103.10) give the linear combinations (103.9) which diagonalize 
the Hamiltonian, i.e. the wave functions of the stationary states of the top 
with a given value of / (and of .17). The calculation of the matrix elements 
of any physical quantity with respect to these wave functions thus reduces 
to that of the matrix elements of the symmetrical top. 

The operators J ;, J „ have matrix elements only for transitions in which 
k changes by unity, while /, has only diagonal elements (see formulae (27.13), 
in which we must write J, k instead of L, M). Hence the operators / ;?, 
Ja Jè and therefore Ê, have matrix elements only for transitions with 
k +k or k42. The absence of matrix elements for transitions between 
States with even and odd & has the result that the secular equation of degree 
2] +1 immediately falls into two independent equations of degrees J and 
J +1. One of these contains matrix elements for transitions between states 
with even k, and the other contains those for transitions between states 
with odd k. Each of these equations, in turn, can be reduced to two equations 
of lower degree. To do this, we must use the matrix elLments defined, not 
with respect to the functions Wyp, but with respect to the functions 


wnt = (Yurt W- Yo = duo 


(103.12) 
PI = (urn —Wa—x) [4/2 (k + 0). 


Functions differing in the index + and — are of different symmetry (with 
respect to a reflection in a plane passing through the ¢-axis, which changes 
the sign of k), and hence the matrix elements for transitions between them 
vanish. Consequently we can form the secular equations separately for the 
+ and — states. 

The Hamiltonian (103.1), with the commutation rules (103.3), has a 
particular symmetry: it is invariant with respect to a simultaneous change in 
sign of any two of the operators J. J, Je This symmetry formally cor- 
responds to the group D2. Hence the levels of an asymmetrical top can be 
classified in accordance with the irreducible representations of this group. 
Thus there are four types of non-degenerate level, corresponding to the 
representations .4, Bi, B2, B3 (see Table 7, 89). 

It is easy to establish which states of the asymmetrical top belong to each 


of these types. To do so, we must find the symmetry properties of the 
yyy and the functions (103.12). This could be done directly from (103.8), 


§103 Quantization of the rotation of a top 417 


but it is simpler to begin from the more usual spherical harmonic functions, 
noting that, as regards their symmetry properties, the wave functions of 
states with definite values of the £-component of the angular momentum are 
the same as the angular-momentum eigenfunctions 


Wk ‘ad Yor*(8, $) ~ e**2O7,( 0), (103.13) 


where 6 and ¢ are the spherical angles in the axes £, n, ¢, and the symbol ~ 
stands for the words “‘is transformed as’’; the complex conjugate is taken in 
(103.13) because of the change in sign on the right of the commutation 
relations (103.3). 

A rotation through an angle v about the ¢-axis (i.e. the symmetry operation 
Cœ) multiplies the function (103.13) by (— 1)*; 


Co: hyp >(— Eyk. 


The operation C2 may be regarded as the result of successively per- 
forming an inversion and a reflection in the €f-plane; the first operation 
multiplies Yy by (—1)/, and the second (a change in sign of ¢) is equivalent 
to changing the sign of k. Using the definition (28.6) of the functions 
©u,—x, we therefore have 


Ca): Wark —>( =- Ga eee 


Finally, the operation Co® = Co™Co(6) gives 
Co: hap —>(— 1d —r. 


Using these transformation rules, we find that the states corresponding 
to the functions (103.12) belong to the following types of symmetry: 


J even, Reven A 

ye J even, kodd B} 
J odd, keven B, 

J odd, kodd Bo 

J even, Reven B; 

_| Jeven, Rodd Bə 
J odd, Reven A 


J odd, kodd B3 


(103.14) 


War 


By simple counting it is easy to find the number of states of each type for 
a given value of J. The following numbers of states correspond to the types 
A and each of Bı, Bo, Bs: 
A By, B2, B3 
J even 4J+1 4J 
ee (103.15) 
J odd tJ —4 J+} 
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For the asymmetrical top there are selection rules for the matrix elements 
of transitions between states of the types A, B,, Ba, B,; these rules are easily 
obtained from symmetry considerations in the usual way. Thus, for the 
components of a vector physical quantity A we have the selection rules 


for 4g: At Bf, Bie BM, 
fort Ac B®, Be BY, (103.16) 
for Ar 5 Aw B®, BH B®, 


For clarity we show, as an index to the symbol for the representation, the 
axis about which a rotation has the character +1 in the representation 
concerned. 


PROBLEMS 


PROBLEM 1. Find the wave functions of the states | JMk) of a symmetrical top by direct 
calculation as the eigenfunctions of the operators J*, Jz, J; (F. Reiche and H. Rademacher 
1926). 


SOLUTION. In order to obtain the wWyarx as functions of the Eulerian angles a, $, y, we 
must express in terms of them the operators of the angular momentum components along 
fixed axes x, y, z. Since the operator of such a component along any axis is—7¢/¢, where ¢ 
is the angle of rotation about this axis, we can write 


a 


Jr = —itlédr, Jy = —iledy, J, = - ijddz, 


where ¢z, $y. $z are the angles of rotation about the corresponding axes. The derivatives 
with respect to these angles can be expressed in terms of those with respect to g, B, y by 
noting that infinitesimal rotations are added like vectors alony the axes of rotation. Figure 20 
(§58) shows the di <ctions of the vectors ôx, 68, ôy of the infinitesimal rotations in terms of 
Eulerian angles. Taking components along the fixed axes x, y, z, we obtain for the angles of 
rotation about these axes 


é dz 
ddy 
6¢2 = 5a + cos f dy. 


—sin x 68 + cos g sin B dy, 


cos g 68 + sin asin B dy, 


Hence, conversely, 
ôg 


—cot 8 cos « $r —cot £ sin g êy + dz, 
ôß = —sin g dz + cos a $y, 


5 = OOo, 4 BOS 
á sin B sin B ty 








From these expressions we find 


c : ô cosa ¢ 
Ji = -if -cos «cot 8 — — sin a — + —~—}, 
Ca CB 








: ; C ĉ sina ô 
Ty = ~i( -sin a cot B = -4 PRR + : ee y 
C ` 


J= E. 


When the operators fz = —2@/@a and J, = —id/@y (y being the angle of rotation about the 
{-axis) act on the function Wye, they are replaced by M and k (the corresponding depen- 
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dence of the wave function on the angles x and y ts given bv the factor exp(1a/W + iyk)). Then 





j = Jiri], = e( GM co B+ £ ) 


e sin B 





eadein — 5 Mcoe BS | 


The rest of the derivation is exactly as at the end of §28. We start from the equation 
J sak = 0, which ts valid for the wave function with M = J. Hence 


o k 
— — f cot B + — . = Ô. 
(5 J cot B = 5) bon 
The normalized solution of this equation is 


ee as eee ee 
bye = (1) E em (cos $8)’ -* (sin 18) -* 


ei (Ja +ky) 
x ; 
2r 





the normalization integral is an Euler beta function. This expression is in fact the same, 


apart from a phase factor, as 
J 21+ ta, B, y): 


8r? 


cf. (58.26). The phase factor is chosen in accordance with the definition in (103.7). 
The wave functions with M <J are then calculated by repeated application to tisk of the 
formula 


j bsar = viJ-ADN+M+ 1)) boars. 


The final result is the same as (103.8), where the functions Di% are (58.10) and (58.11), and 
the symmetry properties (58.18) of these functions are to be taken into account. 


PROBLEM 2. Calculate the matrix elements <Jk’|H] Jk» for an asymmetrical top. 
SOLUTION. From formulae (27.13) we find 

CRP ky = <k Jelky = MJJ + 1)— A), 

CAP AIR+2> = k+ Jelky = -<k J PR+2> = —<R+21/,714> 

= $V i(J-AXJ-R-1XJ+k+1XJ+k+2)); 


for brevity, we everywhere omit the diagonal suffixes J, J of the matrix elements. Hence we 
have for the required matrix elements of the Hamiltoniant 


CRiARY = Iha +h] +1)—k2] + Liek, 
CRIH|R+2> = ¿k+ 2lHiky 
= Mla DRAJ =k - 1X J k+ IXJ +k + 2)). 


(1) 


The matrix elements with respect to the functions (103.12) are expressed in terms of the 
elements in formulae (1) by 


Ck+|H|k+> = CRIAIRD (RAI), <1+|H|l+> = C1H1) + <1H|- 1>, 
l (2) 
Ck +IHIk+2, +> = <k) H|k+2> (k #0), <0+|H|2+> = V2¢0|A}2). 





t In Problems 2-5, the formulae are simplified by using the notation 


@=1jI4, b = 1fIg, c = IIe. 
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PROBLEM 3. Determine the energy levels for an asymmetrical top with J = 1. 


SoLuTION. The secular equation, of the third degree, falls into three linear equations. 
One of these gives 


E; = (04+jHI0+) = bh4a+ b). (3) 


From this we can at once write down the other two energy levels, since it is obvious that the 
three parameters a, b, c enter the problem in a symmetrical manner. Hence 


E = h(a +c), Ez = kb +e). (4) 


The levels E,, Ex, E; belongt to the symmetry types B,, B2, Ba respectively. The wave 
functions of these states are ¥, = yib, fo = Yi, Y = ir 


PROBLEM 4. The same as Problem 2, but for J = 2 


SoLuTION. The secular equation, of the fifth degree, falls into three linear equations and 
one quadratic. One of the linear equations gives 


E; = (2-!H\2—) = 2h®c+ \h%{a +0), (5) 


a level of the type B,. Hence we at once conclude that there must be two other levels, of the 
types B, and B;: 


E> = 2hb + kate), Ez = 2ħ°a + hh +c). 
These three levels have the wave functions p, = Yo, Y: = Yn, Ys = $à- 


The equation of the second degree is 


eee (2+jH|0+) | = 4. (6) 
(2+|H|0+> (2+|H'2+4>-—E 
Solving this, we obtain 

E; 5 = hla +b +c) th?[(a + b+)? 3(ab + be +. ac)}}” (7) 


These levels belong to the type A. The corresponding wave functions are linear combinations 
of y and wah. 


PROBLEM 5. The same as Problem 2, but for J = 3. 


SoLuTION. The secular equation, of the seventh degree, falls into one linear equation and 
three quadratic. The linear equation gives 


Ey = (2-|H|2-) = 2h*%(a+b4+0), (8) 


a level of the type A. One of the quadratic equations is equation (6) of Problem 3, witha 
different value of J. Its roots are 


Peo SK (a+b) + he he[Aa—b)? +e? + ab — ac ~ be} (9) 


levels of the type B,. The remaining levels are obtained from these by permuting a, 6 and c. 


PROBLEM 6. Determine the splitting of the levels of a system having a quadrupole 
moment, in an arbitrary external electric field, 


SOLUTION. Taking as coordinate axes the principal axes of the tensor 2°¢/@xi@xx (see 
§76, Problem 3), we bring the quadrupole part of the Hamiltonian of the system to the form 


Ĥ = Af22+ B] +C], A+ B+C=0. 


+ This follows at once from considerations of symmetry. The energy £;, for instance, 2 
symmetrical with respect to the parameters a and b, and this property belongs to the energy o 
a state whose symmetry about the € and y axes is the same, 1.€. a state of the type B,. 
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Owing to the complete formal analogy between this expression and the Hamiltonian (103.1), 
the problem under consideration is equivalent to that of finding the energy levels of an 
asymmetrical top, the only difference being that here the sum of the coefficients 4 +B+C= 0, 
and the angular momentum can have half-integral values also. For these the calculations must 
be done afresh by the same method, but for integral J we can use the results of Problems 3 
to 5, obtaining the following values for the energy displacement AE for the first few values 
of J: 

AE = —4,-—B,—C; 
32: AE = + y[3(4°+ B+ CCN]; 
=2: AEF =34,3B,3C,~ y [6(4°+ B2+C?)). 
For J = 3/2 the energy levels remair doubly degenerate, in accordance with Kramers’ 
theorem (§60). 


§104. The interaction between the vibrations and the rotation of the 
molecule 


Hitherto we have regarded the rotation and the vibrations as independent 
motions of the molecule. In reality, however, the simultaneous presence of 
both motions results in a peculiar interaction between them (E. Teller, 
L. Tisza and G. Placzek 1932-33). 

Let us start by considering linear polyatomic molecules. A linear molecule 
can execute vibrations of two types (see the end of §100): longitudinal vibra- 
tions with simple frequencies and transverse ones with double frequencies. 
We shall here be interested in the latter. A molecule executing transverse 
vibrations has in general some angular momentum. This is evident from 
simple mechanical considerations,} but it can also be shown by a quantum- 
mechanical discussion. The latter also enables us to determine the possible 
values of this angular momentum in a given vibrational state. 

Let us suppose that some one double frequency w, has been excited in 
the molecule. The energy level with the vibrational quantum number v, 
is (v,+1)-fold degenerate. To this level there correspond the v,+1 wave 
functions 


3 Re 
bog va, = constant x e-¢a (Qa, +Qas MEHL a EO \GaOnn)s 


where vatu = Uy, Or any independent linear combinations of them. 
The total degree (in Q,, and Q,, together) of the polynomial by which the 
exponential factor is multiplied is the same in all these functions, and is equal 
to wy. It is evident that we can always take, as the fundamental functions, 
linear combinations of the functions Pu v,, of the form 


Poata = Constant x e~ex (Car +Qoa V2 [(O 1 HiO) HNO 1 iO 9) a2 n.. J. 
(104.1) 
The square brackets contain a determinate polynomial, of which we have 


t For example, two mutually perpendicular transverse vibrations with a phase difference 
of $7 can be regarded as a pure rotation of a bent molecule about a longitudinal axis. 
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written out only the highest term. /, is an integer, which can take the v,+1 
different values z,, v,—2, v,—4, e, —Ty- | 

The normal coordinates Q,1, Q.2 of the transverse vibration are two 
mutually perpendicular displacements off the axis of the molecule. Under a 
rotation through an angle ¢ about this axis, the highest term of the poly- 


nomial (and therefore the whole function ¥,,) is multiplied by 
ad 
giHdgtly)/2e-ibitg—le)/2 — pile 


Hence we see that the function (104.1) corresponds to a state with angular 
momentum /, about the axis. 

Thus we reach the result that, in a state where the double frequency wa 
is excited (with quantum number z,), the molecule has an angular momentum 
(about its axis) which takes the values 


l = Uy Uy 2, Umh, «-- » Vy. (104.2) 


a 

This is called the etbrational angular momentum of the molecule. If several 
transverse vibrations are excited simultaneously, the total vibrational angular 
momentum is equal to the sum Z/,. On being added to the electror. orbital 
angular momentum, it gives the total angular momentum ? of the molecule 
about its axis. 

The total angular momentum J of the molecule cannot be less than the 
angular momentum about the axis (just as in a diatomic molecule), i.e. J 
takes the values 


F= Wik, .<. 


In other words, there are no states with J = 0, 1,..., |//—1. 

For harmonic vibrations, the energy depends only on the numbers v,, and 
not on l. The degeneracy of the vibrational levels (with respect to the 
values of l) is removed by the presence of anharmonic vibrations. The 
removal is not complete, however: the levels remain doubly degenerate, the 
samc energy belonging to states differing by a simultaneous change of sign of 
all the 1, and of l. In the next approximation (after that of harmonic motion), 
a term quadratic in the angular momenta l, of the form 2 Eaglalg (the g, P 
being constants), appears in the energy. This remaining double degeneracy 
is removed by an effect similar to the A-doubling in diatomic molecules. 

When we turn to non-linear molecules, we must first of all make the 
following remark, which has a purely mechanical significance. For an arbi- 
trary (non-linear) system of particles, the question arises how we can at all 
separate the vibrational motion from the rotation: in other words, what we are 
to understand by a “‘non-rotating system”. At first sight it might be thought 
that the vanishing of the angular momentum, 


= mrxv = 0 (104.3) 


(the summation being over the particles in the system), could serve as a 
criterion of the absence of rotation. However, the expression on the left- 
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hand side is not the complete derivative, with respect to time, of any function 
of the coordinates. Hence the above equation cannot be integrated with 
respect to time in such a way as to be formulated as the vanishing of some 
function of the coordinates. This, however, is necessary if a reasonable 
definition of the concepts of “pure vibrations” and “pure rotation” is to be 

possible. 
As a definition of the absence of rotation, we must therefore use the 

condition 
2 mryXv = 0, (104.4) 


where rọ are the radius vectors of the equilibrium positions of the particles. 
Putting r = rọ+u, where u are the displacements in small vibrations, we 
have v = i= ù. The equation (104.4) can be integrated with respect to 
time, giving 

E mrxu = 0. (104.5) 


The motion of the molecule will be regarded as a combination of the purely 
vibrational motion, in which the condition (104.5) is satisfied, and the 
rotation of the molecule as a whole.t 

Writing the angular momentum in the form 


LmrXv = Lmryxv+ E muxy, 


we see that, in accordance with the definition (104.4) of the absence of 
rotation, the vibrational angular momentum must be understood as the sum 
2 mux v. However, it must be borne in mind that this angular momentum, 
being only a part of the total angular momentum of the system, is not con- 
served. Hence only a mean value of the vibrational angular momentum can 
be ascribed to each vibrational state. 

Molecules having no axis of symmetry of order above the second belong to 
the asymmetrical-top type. In a molecule of this type, all the frequencies 
are simple (their symmetry groups have only one-dimensional irreducible 
representations). Hence none of the vibrational levels is degenerate. In any 
non-degenerate state, however, the mean angular momentum vanishes (see 
§26). Thus, in a molecule of the asymmetrical-top type, the mean vibra- 
tional angular momentum vanishes in every state. 

If, among the symmetry elements of the molecule, there is one axis of order 
higher than the second, the molecule is of the symmetrical-top type. Such a 
molecule has vibrations with both simple and double frequencies. The mean 
vibrational angular momentum of the former again vanishes. To the double 
frequencies, however, there corresponds a non-zero mean angular momentum 
Component along the axis of the molecule. 

It is easy to find an expression for the energy of the rotational motion of the 
molecule (of the symmetrical-top type), taking into account the vibrational 
angular momentum. The operator of this energy differs from (103.5) in that 


+ The translational motion is supposed removed from the start, by 


: i 4 choosing a system of 
coordinates in which the centre of mass of the molecule is at rest. 
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the rotational angular momentum of the top is replaced by the difference 
between the total (conserved) angular momentum J of the molecule and its 
vibrational angular momentum J): 
h 
2l a 


L 


Poe 
ia Lo aks 





J— Foes gr( ) J.-J i. (104.6) 


The required energy is the mean value Hrot- The terms in (104.6) which 
contain the squared components of J give a purely rotational energy which is 
the same as (103.6). The terms which contain the squared components of 
J™ give constants independent of the rotational quantum numbers and may 
be omitted; the terms which contain products of components of J and J 
constitute the interaction here considered between the vibrations of the 
molecule and its rotation. This is called the Cortolts interaction (since it 
corresponds to the Coriolis forces in classical mechanics). In averaging these 
terms it must be borne in mind that the mean values of the transverse (&,7) 
components of the vibrational angular momentum are zero. The mean energy 
of the Coriolis interaction is therefore 


Ego = = Ř?kkyl Ic, (104.7) 


where the integer K is, as in §103, the component of the total angular momen- 
tum along the axis of the molecule, and ky = J, is the mean value of 
the component of the vibrational angular momentum for the vibrational 
state concerned; k, unlike k, is not an integer. 

Finally, let us consider molecules of the spherical-top type. These include 
molecules whose symmetry is that of any of the cubic groups. Such molecules 
have simple, double and triple frequencies (there being one-, two- and three- 
dimensional irreducible representations of the cubic groups). The degeneracy 
of the vibrational levels is, as usual, partly removed by the presence of 
anharmonic motion; when these effects have been taken into account there 
remain, apart from the non-degenerate levels, only doubly and triply degene- 
rate levels. Here we shall discuss these levels that are split by the presence of 
anharmonic motion. 

It is easy to see that, for molecules of the spherical-top type, the mean 
vibrational angular momentum is zero not only in the non-degenerate 
vibrational states but also in the doubly degenerate ones. This follows from 
simple considerations based on symmetry properties. The mean angular 
momentum vectors in two states belonging to the same degenerate energy 
level must be transformed into each other in all possible symmetry trans- 
formations of the molecule. None of the cubic symmetry groups, however, 
allows the existence of two directions transformed only into each other; 
only sets of three or more directions are so transformed. 

From these arguments it follows that, in states corresponding to triply 
degenerate vibrational levels, the mean vibrational angular momentum 1s 
non-zero. After averaging over the vibrational state, this angular momentum 
is represented by an operator whose matrix elements correspond to transitions 
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between three mutually degenerate states. In accordance with the number of 
these states, this operator must have the form Ci, where Î is the operator of 
an angular momentum of unity (for which 2/+1 = 3) and ¢ is a constant 
characterizing the vibrational level in question. The Hamiltonian of the 
rotational motion of the molecule is 


Bro = (h7/21\(J— J)? 


and, after averaging, becomes the operator 
| ope 
Azote = —J2?+—Jor?-—22J..1. 104.8 
rot = P+ je (104.8) 


The eigenvalues of the first term give the ordinary rotational energy (103.4); 
the second term gives an unimportant constant, which does not depend on the 
rotational quantum number. The last term in (104.8) gives the desired energy 
of the Coriolis splitting of the vibrational level. The eigenvalues of the 
quantity J.1 are calculated in the usual way; it has (for a given J) three 
different values corresponding to the values J+1, J—1, J of the vector 
J+l. The result is 


EcorV tP = = hecJ/T, 
EcorY—-) = het(J+1)/T, (104.9) 
Eco” = WII. 


§105. The classification of molecular terms 


The wave function of a molecule is the product of the electron wave func- 
tion, the wave function of the vibrational motion of the nuclei, and the rota- 
tional wave function. We have already discussed the classification and types 
of symmetry of these functions separately. It now remains to examine 
the question of the classification of molecular terms as a whole, i.e. of the 
possible symmetry of the total wave function. 

It is clear that, if the symmetry of all three factors with respect to some 
transformation is given, the symmetry of the product with respect to that 
transformation is determined. For a complete description of the symmetry of 
the state, we must also specify the behaviour of the total wave function when 
the coordinates of all the particles in the molecule (electrons and nuclei) 
are inverted simultaneously. The state is said to be negative or positive, 
according as the wave function does or does not change sign under this 
transformation. t 

It must be remembered, however, that the characterization of the state 
with respect to inversion is significant only for molecules which do not 


t We use the same customary, thou 


(§86). 


gh unfortunate, terminology as for diatomic molecules 
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possess stereoisomers. If stereoisomerism is present, the molecule assumes 
on inversion a configuration which can by no rotation in space be made to 
coincide with the original configuration; these are the “right-hand” and 
“left-hand” modifications of the substance.t Hence, when stereoisomerism 
is present, the wave functions obtained from each other on inversion belong 
essentially to different molecules, and it is meaningless to compare them. ] 

We have seen in §86 that, for diatomic molecules, the spin of the nuclei 
exerts an important indirect effect on the arrangement of the molecular 
terms by determining their degree of degeneracy, and in some cases entirely 
forbidding levels of a certain symmetry. The same is true for polyatomic 
molecules. Here, however, the investigation of the problem is considerably 
more complex, and requires the application of the methods of group theory 
to each particular case. 

The idea of the method is as follows. The total wave function must con- 
tain, besides the coordinate part (the only part we have considered so far), a 
spin factor, which is a function of the projections of the spins of all the nuclei 
on some chosen direction in space. The projection o of the spin of a nucleus 
takes 27+1 values (where 1 is the spin of the nucleus); by giving to all the 
Oy, Op) ««» , Oy (where N is the number of atoms in the molecule) all possible 
values, we obtain altogether (22,+1) (2i,+1) ... (2i,) +1) different values of 
the spin factor. In each symmetry transformation, certain nuclei (of the same 
kind) change places, and if we imagine the spin values to “remain fixed”, 
the transformation is equivalent to an interchange of spin values among 
the nuclei. Accordingly, the various spin factors will be transformed into 
linear combinations of one another, thus giving some representation (in 
general reducible) of the symmetry group of the molecule. Decomposing 
this into irreducible parts, we find the possible types of symmetry for the 
spin wave function. 

A general formula can easily be written down for the characters x,,(G) 
of the representation given by the spin factors. To do this, it is sufficient 
to notice that, in a transformation, only those spin factors are unchanged in 
which the nuclei changing places have the same oa; otherwise, one spin 
factor changes into another and contributes nothing to the character. Bearing 
in mind that ca takes 214+ 1 values, we find that 


Xe(G) = N (2i,+1), (105.1) 


where the product is taken over the groups of atoms which change places 
under the transformation G considered (there being one factor in the product 
from each group). 

We are, however, interested not so much in the symmetry of the spin 
function as in that of the coordinate wave function (by which we mean its 


+ For stereoisomerism to be possible, the molecule must have no symmetry element per- 
taining to reflection (i.e. no centre of symmetry, plane of symmetry, or rotary-refiection axis). 

t Strictly speaking, quantum mechanics always gives a non-zero probability for the tran- 
sition from one modification to the other. This probability, however, which relates to the 
passage of nuclei through a barrier, is extremely small. 
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symmetry with respect to interchanges of the coordinates of the nuclei, the 
coordinates of the electrons remaining unchanged). These two symmetries 
are directly related, however, since the total wave function must remain 
unchanged or change sign when any pair of nuclei are interchanged, accord- 
ing as they obey Bose statistics or Fermi statistics (in other words, it must be 
multiplied by (—1)**, where z is the spin of the nuclei that are interchanged). 
Introducing the appropriate factor in the characters ( 105.1), we obtain the 
system of characters x(G) for the representation containing all the irreducible 
representations by which the coordinate wave functions are transformed: 


x(G) as I(2i4 + 1)(— Lae), (105.2) 


where na is the number of nuclei in each group which change places under the 
transformation in question. Decomposing this representation into irreducible 
parts, we obtain the possible types of symmetry of the coordinate wave 
functions of the molecule, together with the degrees of degeneracy of the 
corresponding energy levels (here and later we mean the degeneracy with 
respect to the different spin states of the system of nuclei). 

Each type of symmetry of the states is related to definite values of the 
total spins of the groups of equivalent nuclei in the molecule (i.e. groups of 
nuclei which change places under the various symmetry transformations of 
the molecule). This relation is not one-to-one; each type of symmetry of 
states can, in general, be brought about with various values of the spins of 
equivalent groups. The relation can also be established, in any particular 
case, by means of group theory. 

As an example, let us consider a molecule of the asymmetrical-top type, the 
ethylene molecule C!*,H}, (Fig. 43g), with the symmetry group D,,. The 
index to the chemical symbol indicates the isotope to which the nucleus 
belongs; this indication is necessary, since the nuclei of different isotopes 
may have different spins. In this case, the spin of the H! nucleus is }, while 
the C?? nucleus has no spin. Hence we need consider only the hydrogen 
atoms. 

We take the system of coordinates shown in Fig. 43g; the z-axis is per- 
pendicular to the plane of the molecule, while the x-axis is along the axis 
of the molecule. A reflection in the xy-plane leaves all the atoms fixed, 
while other reflections and rotations interchange the hydrogen atoms in 


pairs. From formula (105.2) we have the following characters of the represen- 
tation: 


E o(xy) o(xz) o(yz) I C(x) Ca(y) Ca(2) 
16 16 4 4 4 4 4 4 


Decomposing this representation into irreducible parts, we find that it con- 
tains the following irreducible representations of the group D,,: TAr 3B,,, 
3Bo,, 3B3,- The figures show the number of times each irreducible represen- 


t The degree of degeneracy of the level in this respect is often called its nuclear statistical 
weight; see the last footnote to §86, 
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tation appears in the reducible one; these numbers are also the nuclear 
statistical weights of the levels with the corresponding symmetry.t 

The classification of the states of the ethylene molecule thus obtained 
relates to the symmetry of the total (coordinate) wave function, including 
the electron, vibrational and rotational parts. Usually, however, it is of 
interest to arrive at these results from a different point of view. Knowing 
the possible symmetries of the total wave function, we can find at once which 
rotational levels are possible (and with what statistical weights) for any 
prescribed electron and vibrational state. 

Let us consider, for instance, the rotational structure of the lowest vibra- 
tional level (that for which the vibrations are not excited at all) of the normal 
electron term, assuming that the electron wave function of the normal 
state is completely symmetrical (as is the case for practically all polyatomic 
molecules). Then the symmetry of the total wave function with respect to 
rotations about the axes of symmetry is the same as the symmetry of the 
rotational wave function. Comparing this with the results obtained above, we 
therefore conclude that in the ethylene molecule the rotational levels of the 
types A and B, (see §103) are positive with statistical weights 7 and 3, 
while those of the types Bz and Bg are negative with statistice. weight 3. 

As with diatomic molecules (see the end of §86), owing to the extreme 
weakness of the interaction between the nuclear spins and the electrons, 
transitions between states of different nuclear symmetry in the ethylene 
molecule do not occur in practice. Hence molecules in such states behave 
like different modifications of the substance. Thus ethylene Cl*,H1, has 
four modifications, with nuclear statistical weights 7, 3, 3, 3. 

In reaching this conclusion 't is important that states with different 
symmetry belong to different energy levels (the intervals between which are 
large compared with the interaction energy of nuclear spins). The conclusion 
is therefore invalid for molecules in which there exist states of different 
nuclear symmetry belonging to the same degenerate energy level. 

Let us consider another example, the ammonia molecule N14H},, of the 
symmetrical-top type (Fig. 41), whose symmetry group is C,,. The spin of 
the nucleus N? is 1, and that of H? is 4. Using formula (105.2), we find the 
characters of the representation of the group C}, in which we are interested: 


E rs Ol 36y 
24 6 12 


It contains the following irreducible representations of the group C3,: 12A,, 
6E. Thus two types of level are possib':; their nuclear statistical weights 
are] 12 and 6. l 

The rotational levels of a symmetrical top are classified (for a given J) 
according to the values of the quantum number k. Let us consider, as in the 


t The relation between the symmetry of states and the values of the total spin of the four 
hydrogen nuclei in the ethylene molecule is derived ın Problem 1. 

t A total spin of the hydrogen nuclei of 3/2 corresponds to the terms of symmetry A,, and 
one of 1/2 to those of symmetry E. 
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previous example, the rotational structure of the normal electron and 
vibrational state of the NH, molecule (i.e. we suppose the electron and vibra- 
tional wave functions to be completely symmetrical). In determining the 
symmetry of the rotational wave function, we must bear in mind that it is 
meaningful to speak of its behaviour only with respect to rotations about axes. 
Hence we replace the planes of symmetry by axes of symmetry of the second 
order perpendicular to them, a reflection in a plane being equivalent to a 
rotation about such an axis, followed by an inversion. In the present case, 
therefore, instead of the group C,, we have to consider the isomorphous point 
group D.. 

The rotational wave functions with k = +|k| are multiplied by et2zilkys 
respectively under a rotation C, about a vertical axis of the third order, 
while under a rotation U, about a horizontal axis of the second order they 
change into each other, thus giving a two-dimensional representation of the 
group D,. If |k| is not a multiple of three, this representation is irreduc- 
ible; it is Æ. The representation of the group C,, corresponding to the total 
wave function is obtained by multiplying the character x(U,) by 1 or —1, 
according as the term is positive or negative. Since, however, in the repre- 
sentation E we have x(U,) = 0, we obtain the same representation E in 
either case (but this time as a representation of the group C,,, and not D,). 
Bearing in mind the results obtained above, we thus conclude that, when [2] 
is not a multiple of three, both positive and negative levels are possible, with 
nuclear statistical weights of 6 (the symmetry of the total coordinate wave 
function being of the type E). 

When |A| is a multiple of three (but not zero), the rotational functions 
give a representation (of the group D,) with characters 


2 2 oo 





This representation is reducible, and divides into the representations A,, Ao. 
In order that the total wave function should belong to the representation A, of 
the group C,,, the rotational level A, must be negative and A, positive. 
Thus, when |A| is a multiple of three and not zero, both positive and negative 
levels are possible, with nuclear statistical weights of 12 (levels of the type 
Ao). 

Finally, only one rotational function corresponds to an angular momentum 
component k = 0; it gives a representation with characterst 


E Soest’. 
1 be ety 





If the total wave function has the symmetry A,, its behaviour with respect to 
inversion must therefore be given by the factor (—1)’*!. Thus, for k = 0, 


t On rotation’ through an angle v, the eigenfunction of the angular momentum with 
magnitude J and component zero is multiplied by (—~1)Y. 
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levels with even and odd J can only be negative and positive respectively; 
the statistical weight is 6 in either case (levels of the type A,). 

Summarizing these results, we have the following table of possible states 
for various values of the quantum number k for the normal electron and 
vibrational term of the molecule N44H}43 (the symbols + and — denoting 
positive and negative states). 


-+ — 
|k | not a multiple of 3 6E 6E 
| k | a multiple of 3 1242 1242 
k=0 J even — 6A2 

a J odd 6A2 — 


For given J and k, the energy levels of the NH3 molecule are in general 
degenerate (see also the table for ND3 in Problem 3). This degeneracy is 
partly removed by a peculiar effect due to the flat shape of the ammonia 
molecule and the small mass of the hydrogen atoms. By a fairly small vertical 
displacement of the atoms in this molecule a transition can be brought about 
between two configurations obtained from one another by a reflection in a 
plane parallel to the base of the pyramid (Fig. 44). These transitions cause a 
splitting of the levels, separating positive and negative levels (an effect 
similar to the one-dimensional case considered in §50, Problem 3). The 
magnitude of the splitting is proportional to the probability of passage of the 
atoms through the “potential barrier” separating the two configurations of 
the molecule. Although this probability is comparatively high in the ammonia 
molecule, owing to the above-mentioned properties, the splitting is still 
small (1 x 10-4 eV). 

An example cf a molecule of the spherical-top type is discussed in 
Problem 5. 


N 
Fic. 44 


PROBLEMS 
PROBLEM 1. Find the relation between the symmetry of the state of the C??2H14 molecule 
and the total spin of the hydrogen nuclei in the molecule. 
SOLUTION.t The total spin of the four H! nuclei can take the values J = 2,1, 0, and its 
component M; takes values from 2 to —2. Let us consider the representations given by the 
spin factors for each value of Mı, beginning with the largest. 


+ A method of solving problems of this kind, based on the theory of permutation groups, 
in miran in Kanlan’s book (quoted in §63), ch. VI, §2. 
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The value Mr = 2 corresponds to only one spin factor, in which all the nuclei have a spin 
component + $. The value Mr = 1 corresponds to four different spin factors differing as 
regards the nucleus which has spin component —. Finally, the value Mr = 0 is given 
by six spin factors, depending on the pair of nuclei which have spin components —$. The 
characters of the three corresponding representations are as follows: 


E (xy)  o(xz) of yz) I Coz)  Ca(y) C(x) 
1 1 1 1 


Mr = 2 1 1 1 1 
Mr = 1 i + 0 0 0 0 0 0 
Mr=0 6 6 2 2 2 2 2 2 


The first of these representations is the unit representation Ag; since the value Mp = 2 
can occur only for J = 2, we conclude that a state with symmetry Ag corresponds to spin 
i= 2, 

The value M: = 1 can occur for both J = 1 and J = 2. Subtracting the first representa- 
tion from the second and decomposing the result ir.to irreducible parts, we find that states 
Big, Bou, Bay correspond to spin J = 1. 

Finally, the value Mr = O can occur in all cases where Mr = 1 is possible, and also for 

= 0. Subtracting the second representation from the third, we find two states Ag corres- 
ponding to spin J = 0. 

PROBLEM 2. Determine the types of symmetry of the total (coordinate) wave functions, 
and the statistical weights of the corresponding levels, for the molecules Ce ee A 
N1405, (ali these molecules are of the same form; the spins are i(H?) = 1, (C5) = 4, 
(N11) = 1). 

SOLUTION. By the method shown in the text for the molecule C",H',, we find the follow- 
ing states (the axes of coordinates being taken the same as above): 


Molecule + zi 

LA A 27Ag, 18Big i8Beu, 18B3, 
C13,H1, 16A,g,12B i, 12Beu, 24Bzu 
N14,016, 6A, 3B3u 


PROBLEM 3. The same as Problem 2, but for the molecule N!*H?,. 

SOLUTION. In the way shown in the text for the molecule N"*H?,, we find the states 3044, 
3A,, 24E. 

In the normal electron and vibrational term, the following states are possible for various 
values of the quantum number k: 


+ me 
Jk] not a multiple of 3 24E 24E 
[A] a multiple of 3 304,,34,  30A,, 34, 
jee 0 | J even 304, 34, 
J odd 34, 304, 


PRoBLEM 4. The same as Problem 2, but for the molecule C!1,H', (see Fig. 43f; the 
symmetry is Dy a). 


SOLUTION. The possible states are of the types 74g, 1Au, 34g, 134, 9E;, 11Eu. 
In the normal electron and vibrational term, the following states are obtained: 


abe — 
|| not a multiple of 3 9E; 11Eu 
Jk] a multiple of 3 TAg, 3A 1A, 13A 
nn | J even TA 14u 
J odd 3Aag 13 Aye 


_ ProsLem 5. The same as Problem 2, but for the methane molecule C'H1, (the C atom 
is at the centre of a tetrahedron with the H atoms at the vertices), 


SOLUTION. The molecule is of the spherical-top type, and has the symmetry Ty. Follow- 
ing the same method, we find that the possible states are of the types 542, 1E 3F 1 (cor- 
responding to a total spin of the molecule of 2, 0, 1 respectively). À 

The rotational states of a spherical top are classified according to the values of the total 
angular momentum J. The 2J-+1 rotational functions belonging to a particular value of J 
give a (2 )+1)-dimensional representation of the group O, which is isomorphous with the 
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group Ta; it is obtained from the latter by replacing ali planes of symmetry by axes of the 
second order perpendicular to them. The characters in this representation are given by 
formula (98.3).° Thus for example, for J = 3 we obtain a representation with characters 


E 8C3 6C, 6C, 3C,? 
7 l a l EA 





This contains the following irreducible representations of the group O: A, F,, Fẹ Again 
considering the rotational structure of the normal electron and vibrational term, we therefore 
conclude that, for J = 3, the states with a symmetry Ae of the total wave function can only 
be positive, while those of type F, can be either positive or negative. For the first few values 
of J we thus obtain the following states (which we write together with their statistical weights) : 


5A., 3F; 3F, 
1E, 3F, 5A,, 1E, 3F; 


ee 
tow wv uu 
A wh = © 
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CHAPTER XIV 


ADDITION OF ANGULAR MOMENTA 


§106. 3j-symbols 


THE rule of addition of angular momenta deduced in §31 gives the possible 
values of the total angular momentum of a system consisting of two particles 
(or more complex components) with angular momenta jı and je.f This rule 
is in fact closely related to the properties of wave functions with respect to 
spatial rotations, and follows immediately from the properties of spinors. 

The wave functions of particles with angular momenta jı and jg are sym- 
metrical spinors of ranks 27; and 2j2, and the wave function of the system is 
their product, 


A flee. po... 
(1) ——— y2) 
pa) yer (106.1) 


Symmetrizing this product with respect to all the indices, we obtain a sym- 
metrical spinor of rank 2(j1+j2), corresponding to a state with total angular 
momentum ji+j2. If we contract the product (106.1) with respect to one 
pair of indices, of which one must belong to Yy® and the other to Y (since 
otherwise the result is zero), the symmetry of each of the spinors 4@ and 
ws) shows that it does not matter which indices are taken from A, p, ... and 
p,o,.... After symmetrization we obtain a symmetrical spinor of rank 
2(j1+je—1), corresponding to a state with angular momentum ji+je—1.f 
Continuing this process, we find, in agreement with the rule already known, 
that 7 takes values from jı +j2 to |j1—je|, each occurring once. 
Mathematically, this involves the decomposition of the direct product 
D» x DG) of two irreducible representations of the rotation group (with 





t Strictly speaking, we shall always be considering (without explicitly mentioning the fact 
each time) a system whose parts interact so weakly that their angular momenta may be 
regarded as conserved in a first approximation. 

All the results given below apply, of course, not only to the addition of the total angular 
momenta of two particles (or systems) but also to the addition of the orbital angular momen- 
tum and spin of the same system, assuming that the spin-orbit coupling is sufficiently weak. 

t To avoid misunderstanding, the following comment is useful. The wave function of a 
system of two particles is always a spinor of rank 2(j:+J2), and this is in general not equal to 
2p where j is the total angular momentum of the system. Such a spinor may, however, be 
equivalent to a spinor of lower rank. For example, the wave function of a system of exe 
particles with angular momenta Jı = je = 41s a spinor of rank two; but if che total angular 
momentum j = 0, this spinor is antisymmetrical, and therefore reduces to a scalar In 
general, the total angular momentum j determines the symmetry of the spinor wave furiction 


of the system: this ts symmetrical with respect to 2) indices and antisymmetrical with respect 
to the remainder 


ARSS 
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dimensions 2/1 + 1 and 2j2+ 1) into irreducible parts. The addition rule for 
angular momenta may then be written as 


DW x DG) = Dih+id 4 DM+e-D 4 4 DUAR-F), 


For a complete solution of the problem of the addition of angular momenta, 
we must also consider the problem of constructing the wave function of a 
system with a given total angular momentum from those of its two component 
particles. 

Let us begin with the simple case of the addition of two angular momenta 
to give a zero total angular momentum. Here we must evidently have 
ji = jg and angular momentum components mı = —ñt2. Let pjm be the 
normalized wave functions of the states of one particle with angular momen- 
tum j and component thereof m (in the non-spinor representation). The 
required wave function Wo of the system is the sum of the products of the wave 
functions of the two particles with opposite values of m: 


to = se 3 (IAY D mp m (106.2) 
V+) £, 
where j is the common value of jı and jo. The factor preceding the sum is 
due to the normalization. The coefficients in the sum must all have the 
same absolute value, since all values of the components m of the angular 
momenta of the particles are equally probable. The sequence of signs in 
(106.2) is easily found by means of the spinor representation of the wave 
functions. In spinor notation the sum in (106.2) is a scalar (the total angular 
momentum of th. system being zero) 


PDAperngl2, (106.3) 


formed from two spinors of rank 2j. Using this, we find the signs in (106.2) 
directly from (57.3). 

It should be borne in mind, however, that in general only the relative 
signs of the terms in the sum (106.2) are determinate, while the sign of the 
whole sum may depend on the “‘order of addition” of the angular momenta. 
For, if we lower all spinor indices (j-+m ones and j—m twos) in ¥ and 
raise them in 72), the scalar (106.3) is multiplied by (—1)*/, and therefore 
changes sign when ; is half-integral. 

Next we consider a system with zero total angular momentum consisting of 
three particles with angular momenta ji, J2, J3 and components thereof my, 
mo, m3. The condition for the total angular momentum to be zero is that 
m,+m2+m3 = 0 and jj, je, J3 have values such that each of them can be 
obtained by vector addition of the other two, i.e. geometrically j1, J2, J3 Must 
be the sides of a closed triangle. In other words, each of them lies between 
the difference and the sum of the other two: 


lji —j2| < ys < ji +2, ete. 


It is evident that the algebraic sum 71+ j2+ 3 is an integer. 
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The wave function of the system under consideration is the sum 


pa S . J2 J3 J mb mab ma (106.4) 
Mi Mo Mg 
MiM, M3 
taken over the values of each m; from —j; to j;. The coefficients in this formula 
are termed Wigner 3j-symbols. By definition they are non-zero only if 
mı + m+ m3 = 0. 

When the sufhxes 1, 2, 3 are permuted, the wave function (106.4) can 
change only by an unimportant phase factor. The 3j-symbols can in fact be 
defined as purely real quantities (see below), and then the indeterminacy of 
Yo Can consist only in its sign as a whole being indefinite (as is true of the 
function (106.2) also). This means that interchanging the columns of a 
3j7-symbol can either leave it unchanged or change its sign. 

The most symmetrical way of defining the coefficients in the sum (106.4), 
which is the definition generally used for the 3j-symbols, is as follows. In 
spinor notation, yo 1s a scalar formed by contracting the product of the three 
spinors PQ)Ae-, Yau... pAr- with respect to all pairs of indices belonging 
to two different spinors. In each pair belonging to particles 1 and 2 the spinor 
index will be written superior with Y® and inferior with Y@); in a pair belong- 
ing to particles 2 and 3, superior with %@) and inferior with y); and in a pair 
belonging to particles 3 and 1, superior with ¢@ and inferior with y®. It 
is easily seen that the total number of pairs of each kind is ji +j2—js, J2+]3— f1, 
Jı +j3—j2 respectively. This rule determines uniquely the sign of wp. 

It is evident that, with this definition, cyclic interchange of the indices 
1, 2 and 3 leaves po unchanged. This means that the 3j-symbol is unchanged 
when its columns are cyclically permuted. Interchange of any two indices 
is easily seen to require the raising of the lower indices and lowering of the 
upper indices in all jį +j2+j3 pairs. This means that go is multiplied by 
(—1)/+3:+33; in other words, the 3j-symbols have the property 


a z (= 1ye ( 7 ha "= etc., (106.5) 


m2 Mı mg my me Mg 


i.e. they change sign when two columns are interchanged if jı +j2+j3 is odd. 
Finally, we easily see that 


jij i) = (- tytn ( Je +: (106.6) 


—m, -m2 — M3 mi Mo m3 


a change in the sign of the z-component of each angular momentum can 
be regarded as the result of a rotation through an angle m about the y-axis, 
and this is equivalent to raising all the lower spinor indices and lowering all 
the upper ones (see (58.5)). 

From (106.4) we can derive an important formula which gives the wave 
function Wjm of a system consisting of two particles and having given values 
ofj and m, To do so, we consider the particles 1 and 2 together as one system. 
Since the angular momentum j of this system together with the angular 
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momentum jg of particle 3 gives a total angular momentum of zero, we must 


have j = j3, m = — m3. According to (106.2) we can then write 
l 
po = Teja = (—1y pimp! 7 —m- (106.7) 


This formula is to be compared with (106.4) (in which we replace j3, mg by 
j, —m). Here, however, we must first take into account the fact that the 
rule for constructing the sum in (106.7) according to (106.3) does not corres- 
pond to the rule for constructing the sum (106.4): to bring (106.7) to the 
form (106.4) we must, as is easily seen, interchange pairs of upper and lower 
indices corresponding to particles 1 and 3. This leads to an additional factor 
(—1)#-#+%, The result ist 


hepy 


my, mo —m 


m = (=n) > ( 


Mi Me 


) Pi my am (106.8) 


where the summation over m and mg is subject to the condition mı +m = m. 

Formula (106.8) gives the required expression for obtaining the wave 
function of a system from those of its two particles, which have definite 
angular momenta f; and jọ. It can be written in the form 


him = > Cmymoljm> pjm p Pm, (m2 = m—m)). (106.9) 


Mi, Me 


The coefficients 
ate | (106.10) 


my mo —m 


<myme] jm) = (=y y/(2j+1)( 


form the matrix of the transformation from the complete orthonormal set of 
(2); +1)(2j2+1) wave functions of states |m; m2) to the similar set of wave 
functions of states jjm> (for given values of ji, j2). They are called vector 
addition coefficients or Clebsch-Gordan coefficients. The notation <mımo| jm > 
corresponds to the general notation for the coefficients in the expansion of 
one set of functions in terms of another (11.18). To simplify, we have 
omitted the quantum numbers jı and j2, which are the same in both sets of 
functions. When necessary, these are included, in the form ¢jimijame| 


pyejm).T 
+ Under time reversal, the wave functions change in accordance with (60.2): 
Wim —>( = 1)7-™h;, —m- 
It is easily verified that the function Yim on the left of (106.8) is transformed in this way if the 


functions ¢j,m, and yj.m, on the right are. l : 
t The Clebsch-Gordan coefficients are also denoted in the literature by 


jm Im 
C ia, or Scaler” 
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The matrix of the transformation (106.9) is unitary (see §12). The co- 
efficients of the inverse transformation 


J, +j, 
YD 5m PO jm, = > Cj, my + mg|mym2>h 5m, +m, (106.11) 
j=tj -jl 


are therefore the complex conjugates of those in the transformation (106.9). 
We shall see later that these coefficients are real, so that we have simply 


<mım| jm = <jmimyme)>. 


According to the general rules of quantum mechanics, the squares of the 
coefficients in the expansion (106.11) give the probability for the system to 
have any particular values of j and m (for given j1, mı and j2, m2). 

The unitarity of the transformation (106.9) means that its coefficients 
satisfy certain orthogonality conditions. According to formulae (12.5) and 


(12.6) 


D <miımo|jmy<miımojj mY 


MLM 


= (%+1) z 3 J2 A ie j2 gi 
myi m —m/ \m, Mm —m' 


MMag 


= Ôj Ômm’, (106.12) 


> <mymz| jm) <my'm3'|jm)> 


j 
= > a+n(" je oe j2 a 
; “Xm, m —m/ \m' me’ —m 


Sia (106.13) 


mim’ Meme’ 


The explicit general form of the 3j-symbols is quite lengthy. It can be 
written ast 


(” j2 e 7 [es 
mı mz m3 (ji+je+js+ 1)! 


t The coefficients in (106.9) were first calculated by E. P. Wigner (1931). ‘Their symmetry 
properties and the symmetrical expression (106.14) were first derived by G. Racah (1942). 
The most direct method of calculation is probably to go immediately from the spinor repre- 
sentation of %ọ (appropriately normalized) to the representation in the form of the sum (106.4) 
by means of the correspondence formula (37.6); it mav be noted that, since the coefficient in 
this formula is real, so also must be the 3j-symbols. Another derivation is given by A. R. 
Edmonds, Angular Momentum in Quantum Mechanics, Princeton, 1937. The table of 3j- 
symbols given below is also taken from Edmonds’s book. 
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x [(J1 + m)! (jı — mı)!(j2+m2)!(J2— me)!(j3-+ m3)\(j3— ms)! J1? x ` 


(— lyti jeema 
«2 ae ea E 


=~ 2l(fit+je—j3—2)!(j1 — m — 2)!'(go + me— 2)!(Jg—jotmi+2z)'(a—-f— m2+ 2z)! 
(106.14) 


The summation is over all integers z but, since the factorial of a negative 
number is infinite, the sum contains only a finite number of terms. The 
coefficient of the sum is obviously symmetrical in the suffixes 1, 2, 3; the 
symmetry of the sum itself appears if the values of the summation variable 
z are interchanged. 

Besides the symmetry properties (106.5) and (106.6), which follow imme- 
diately from the definition of the 3j-symbols, the latter also have other 
symmetry properties, though the derivation of these is more complex and 
will not be given here. The properties in question can be conveniently 
formulated in terms of a three-by-three array of numbers derived from the 
parameters of the 3j-symbol as follows: 


oh E Jotja—fi jstji—je jitje—Js 
Ji J2 )3 2 l 
( )- Jı mı j2—me J3—™3 | (106.15) 


mı M2 mg ' l f 
j+ mı j+ m2 Jat mg 


the sum of the numbers in each row and each column of this array is Jı + jz +13. 
Then (1) interchange of any two columns cf the array multiplies the 3)- 
symbol by (—1)/:+7:+7s (the same property as that given by (106.5)); (2) the 
same is true for interchange of any two rows (for the two lower r>ws, the same 
property as that given by (106.6)); (3) the 3j-symbol is unchanged when the 
rows and columns of the array are interchanged.t 

Some of the simpler formulae for particular cases will be given here. The 
value 


J 109 l 
( ) = (—1)i-m_—__ — (106.16) 
m —m 0 4/(27 +1) 


corresponds to formula (106.2). The formulae 
( n je ntf 
mı mo —mMmy—Me 


) = (— ])/:-datmitme x 


(2j1)!(2je)!(jr +j2 + mı + me)!(ji-+je— mı — me)! 


1/2 
Aa a a 
(2ji + 2jo-i-1)'(ja +m)!(j1 — m)!(j2 + m2)!(j2— S 


C je J3 
ji —j1— m3 mg 


) = (— 1)-jtjt+ma x 


+ See T. Regge, Il nuovo cimento [10] 10, 544, 1958; 11, 116, 1959. The more profound 
mathematical features of the symmetry property (106.15) (and of the property (108.3) of the 
6j-symbols) are discussed in the review article by Ya. A. Smorodinskii and L. A. Shelepin, 
Soviet Physics Uspekhi 15, 1, 1972. 
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(2j1)\(—ji +J2 +j3)!(j1 +j2 + ms)!(j3— ma)! i 
(Ja +j2+j3 + 1)Mj1—Jo+ja)\(j1 +32—Js)\(—ji-+ye—ms)!(js-+ ma)! 
are obtained directly from (106.14). The derivation of the formula 
( je i) =| ef +j- j)! Gi-j+})( -j1 +j a 
000 (2p+1)! 
p! 
A a aa 
(p—j1)!(b —72)!(p—ja)! 
where 2p = j1+j2+/3 is even, requires a number of additional calculations ;+ 
when 2p is odd, this 3j-symbol is zero owing to the symmetry property 
(106.6). 
Table 9 gives for reference the values of the 3j-symbols for j3 = 4, 1, 3, 2. 
For each jg the minimum number of 3j-symbols are shown from which 
the remainder may be obtained by means of the relations (106.5), (106.6). 


(106.18) 


TABLE 9 
Formulae for 3j-symbols 


es ae 1,2 


= 7 
a s 


m —m~-t4 


l 
Cie a) 
e —M— m3 Mg 


ma 0 


) = pm 


2m 
[CI+ 1)(27 +2)]'? 
[em 1)\(g7—m4+1)q}- 
(27 + 1)(27 + 2)(2j + 3) 


Nis l 
Ji 


Daria] 2 
! 2)(27 + 1)(27 +2) 
7¥*1 Gj- _Gamgamtly m+) le 
(27 + (+ (2+ 22+ 3) 3) 
(—1)i-m-1 (" aa ) 
m —m— m3 m3 
os E 
-m+ 12 
itt -urme i] 
2)(2j + 1)(2j + 2X2; +3) 

+3 freee r 12 

l (2) + INQ + 2)(2j + 3)(27 +4) 


t See Edmonds’s book already quoted. 
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itt fe ea Js 
29(2j + 1)(27 + 2)(2j + 3) 
G-m-}- m+} -m+ 3y] 
j+3 VS al] 
(27 + 1)(27 +2)(27 + 3)(27+4) 


i r ? 
CT 7) 
n —m— n3 m3 


1 


2[3m?—j(7 + 1)] 


J as a ee 
[27 — 12727 + IX + 227 + 3)] = 
6(j+mtl)G—omtl1) We 
JEn 3 eG 
29(2j + 127 + 2)(27 + 327+ 4) 
a2 ee 1/2 
(27 + 1)(27 + 2)(27 + 3)(2j + 4)(27 + 5) 


1 


6(j+ mt 1)(y—m) 1/2 
j ne of = | 
(2j— 1)2j(2j + 1)(2 + 2)(2j +3) 
La -jrm —— a] 
2j(2j+1)(2j + 2X2 + 3X(2j +4) 


ap m+2)(j—=m+2)(j—m+1)j—-m)7"* 
(2) + 1X27 + 2)(2j + 3)(2j + 4)(2 + 5) 


1 


So 1)(j+m42)})? 


i G- DAGANG) 
G-m-1Xj-m(j-m+1)J+m+2)7!? 
s -DGG 
. (j—m—1)(j—m)(j—m+ 1I)G—m+2)7)" 


PROBLEM 


§106 


Determine the angle dependence of the wave functions of a particle with spin $ in states 
with given values of the orbital angular momentum l, the total angular momentum j and 


component thereof m. 


SoLUTION. The problem is solved by the general formula (106.8), in which 4%? must 
be taken as the eigenfunctions of the orbital angular momentum (i.e. the spherical harmonic 


functions Yim';), and ¥) as the spin wave function x(c) (where o = +4): 
HF J 
bm = (oyin > ( 


P= GG = 


) ¥i.m-ox(¢)- 
Substituting the values of the 3j-symbols, we obtain 


jèm ee 
hr 41/29m = a Vimaer | Foxman 
2j 2j 








—m+ l i ptm i a 
pi -1/2.m = - f Kamat | x(—$) Yi, m4172. 


2j+2 2j+2 
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§107. Matrix elements of tensors 


In §29 formulae have been obtained which give the matrix elements of a 
vector physical quantity in terms of the value of the angular momentum 
component. These formulae are really a particular case of the corresponding 
general formulae for an irreducible (see §57) tensor of any rank.} 

The set of 2k + 1 components of an irreducible tensor of rank k (an integer) 
are equivalent, as regards their transformation properties, to a set of 2k+1 
spherical harmonic functions Ypg, g = —h, ..., k (see the last footnote to 
§57). This means that, by means of appropriate linear combinations of the 
components of the tensor, we can obtain a set of quantities which are trans- 
formed under rotations as the functions Ypg. A set of such quantities, which 
will be denoted here by fig, is called a spherical tensor of rank k. 

For example, k = 1 for a vector, and the quantities fig are related to the 
components of the vector by the formulae 


t 
fio = ta, fiat = t pgti): (107.1) 
cf. (57.7). The corresponding formulae for a tensor of rank two are 
fzo = E f2 — PN), } (107.2) 
f2 ae (azz — Ayy +t 2iazy), 


with azz+ayy+4zz = 0.t 

The construction of tensor products from two (or more) spherical tensors 
fra» Ska, 18 effected in accordance with the rules for addition of angular 
momenta, with kı, ke formally representing the “‘angular momenta” corres- 
ponding to these tensors. Thus from two spherical tensors of ranks k and 
kz, one can form spherical tensors of ranks K = kı + ko, ..., [ki — ke| by means 
of the formulae 


(Jk, 2k) KQ = pa <qıq2| KOY fragrga: 


di ga 
K 


kı ko 
= (—] kı—kı+Q, / 2K +1 
(= eere P rit 


faatia (107.3) 


cf. (106.9). The scalar product of two spherical tensors of the same rank k is, 
however, usually defined as 


(Jfk8r)oo = 2 (— 1)¥-9fx 2k, ~a: (107.4) 


which differs from the definition according to (107.3) with K = Q = 0 by 


t The analysis of the problems discussed in §§107~-109, and most of th 
due to G. Racah (1942-1943), ost of the results given, are 


t It ts assumed that the quantities frq are complex only because of the use of spherical 
components, t.e. that the original Cartesian components of the tensor are real. 
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the factor 4/(2k+1); cf. (106.2).¢ This definition can also be written in the 
form 


(fiek)oo = D fraBra* 


if we note that the complex conjugate of a spherical tensor is given by 


frat = (— 1)" “fx, -o3 


cf. (28.9).T 

The representation of physical quantities in the form of spherical tensors 1s 
particularly convenient for the calculation of their matrix elements, since it 
allows the direct application of the results of the theory of addition of angular 
momenta. 

By the definition of the matrix elements we have 


fralnim = > <njm'| feginim>Yn-jans (107.5) 
n'im’ 

where the Ynjm are the wave functions of the stationary states of the system, 
described by its angular momentum j, component thereof m and the set of 
the remaining quantum numbers n. As regards transformation properties, 
the functions on the right-hand and left-hand sides of equation (107.5) 
correspond to the respective sides of equation (106.11). Hence we imme- 
diately obtain the selection rules: the matrix elements of the components fkg. 
of an irreducible tensor of rank À are zero except for transitions jm > jm’ 
which satisfy the “angular momentum addition rule’ j’ = j+ k; the numbers 
J’, J, k must satisfy the “‘triangle rule’, i.e. must be able to form the sides of a 
closed triangle, and m’ = m+q. In particular, the diagonal matrix elements 
can be different from zero only if 27 > k. 

Next, it follows from the same transformation correspondence that the 
coefficients in the sum (107.5) must be proportional to the coefficients in 
(106.11) (the Wigner—Eckart theorem). This determines the dependence of 
the coefficients on the numbers m and m’, and the matrix elements are there- 
fore written in the form 


ace F} z « . ? Y k ] rh I . 
enj franj) = P- Iyim J 5 aKa | feli7g>, (107.6) 


where j,,,, is the greater of j and j’, and the <n'j'|| fx||mj> are quantities 
independent of m, m’ and q, called the reduced matrix elements. This formula 
gives the solution to the problem of determining the dependence of the 
matrix elements on the angular momentum components. The dependence 








+ If A and B are two vectors corresponding to the spherical tensors f4 and gio according to 
formulae (107.1), then (fig;Joo = A. B. 

į Here we may repeat the comment made concerning formula (106.8): with this rule, 
taking the complex conjugate of the tensors of ranks k, and k, on the right of (107.3) leads to 
a similar result for the tensor of rank K on the left. 
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is entirely governed by the symmetry properties with respect to the rotation 
group, whereas the dependence on the other quantum numbers is deter- 
mined by the physical nature of the fxg themselves.+ 

The operators fq are related by 


frat = (-1 -Y;,~¢. (107.7) 


The equation 
Cag m| frqlnjmy* = (—1)¥-0<njm| fr,—q\n'j'm') (107.8) 


therefore holds for their matrix elements. Substituting (107.6) and using the 
properties (106.5) and (106.6) of the 3j-symbols, we obtain for the reduced 
matrix elements the “Hermitian” relationt 


Cni’ felng> = <ni feln jy”. (107.9) 


The matrix elements of the scalar (107.4) are diagonal inj and m. According 
to rule of matrix multiplication 


<n'jm|(fegx)oolnyjm> = = (—1)k-a È <njm fealn”j my x 


x (n ym" |g, —g|njm). 


Substituting here the expressions (107.6) and effecting the summation over q 
and m” by means of the orthogonality relation for the 3j-symbols, we obtain 


k . 1 s f re'r E l E l y 
<n'jml( fegeoolnjmy = = Dail frin j Xn j Igulini> 
jan Dp 
(107.10) 


Similarly, we easily obtain the following formulae for the sums of the 
squared matrix elements: 





y ? = 9 1 peti 5 ) 
> ia m'| figinjm>|? = ——|<n'j"' frig, (107.11) 
ae 2j+1 , 
> [<n] m| frginjm>| ? = ain frinj>|?. (107.12) 


m,m’ 


In the first of these the summation is over g and m’ for a given value of m, 


and in the second it is over m and m’ for a given value of g (in every case, 
d 
m = m+4q). 


+ From these results, in particular, there follow at once the selection ru 
the matrix elements of a vector and formulae (29.7)-(29.9) for these. 
t The phase factor in the definition (107.6) is in fact chosen so as to make this relation valid. 


les given in §29 for 
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For reference purposes we may consider the case where the quantities fxg 
are the spherical harmonic functions Yxq¢ themselves, and give their matrix 
elements for transitions between the states of one particle with integral orbital 
angular momenta l and l2, 1.e. the integrals 


<lym}| Yim|leme> = f Yim” Yim Yum, do. (107.13) 


Besides the selection rule corresponding to the rule of addition of angular 
momenta (1+12 = 1), there is also a rule for these matrix elements whereby 
the sum /+1,+/, must be even. This is due to the conservation of parity, 
according to which the product (— 1)L+!: of the parities of the two states must 
be the same as the parity (— 1)! of the physical quantity considered (see §30). 

The matrix elements (107.13) are a particular case of a more general 
integral to be calculated in §110 (see the footnote to that section). They are 
given by 

(Lym1| Yrm| leme) 


= (= 1min h i A l “4 y 
-m m mjJ\O 0 0 


D pee 1)(2h, + 1)(2le + yy (107.14) 
A 


In particular, for m, = mz = m = 0 we find the integral of the product of 
three Legendre polynomials: 


1 


| POP.OdPuO) du = 2() es (107.15) 


ae 


§108. 6j;-symbols 


In §106 we have defined 3j-symbols as the coefficients in the sum (106.4) 
which represents the wave function of a system of three particles with zero 
total angular momentum. As regards the transformation properties under 
rotations, this sum is a scalar. Hence it follows that the set of 3j-symbols 
with given values of jı, J2, js (and all possible mı, m2, mg) may be regarded as 
a set of quantities which are transformed under rotations according to a law 
contragredient to that for the products Wy m,bj,m,¥j,m, SO that the sum as a 
whole is invariant. 

From this viewpoint we may put the problem of constructing a scalar 
consisting of 3j-symbols only. This scalar must depend only on the numbers 
j, and not on the numbers m, which are altered by rotations. In other words, 
it must be expressible in terms of sums over all the numbers m. Each such 
sum consists of a “contraction” of the product of two 3j-symbols according 


to the formula 
a EEN ret 


cf. the acme of constructing the scalar (106.2). 
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Since in each “‘contraction’”’ a pair of numbers m is involved, we must 
consider products of an even number of 3j-symbols in constructing the 
complete scalar. The contraction of the product of two 3j-symbols gives, 
owing to their orthogonality, the trivial result 


> e j2 at Ji Je 7) aa 
mı mo ms — Mı -mə -m3 


M Ma3 
Jt J2 ja Ņ\e 
my, Mə? M3 


Mi M2 Ma 


here we have used the equation mı +mz2+ m3 = 0 and formulae (106.6) and 
(106.12). The smallest number of factors needed to form a non-trivial scalar 
is therefore four. 

In each 3j-symbol the three numbers ; form a closed triangle. Since 
each number j must appear in the “contraction” in two 3j-symbols, it is clear 





Fic. 45 


that in the construction of a scalar from the products of four 37-symbols 
there are six numbers j forming the edges of an irregular tetrahedron (Fig. 45), 
one face of which corresponds: to each 3j-symbol. In defining the required 
scalar it is customary to use a certain condition as regards the contraction 
process, given by the formula 


a J2 A = > (=D Edema ( ee a x 
Ja Js Je —m, —m2 —m3 


all m 


z la J5 oe Je "N ja J5 on (108.2) 

mı —ms me/ \ma mo — me —mMa MS m3 
The summation here is over all possible values of all the numbers m; however, 
since the sum of the three m in every 3j-symbol must be zero, only three of the 


Six m are in fact independent. The quantities defined by the formula (108.2) 
are called 6j-symbols or Racah coefficients.+ 


+ The notation 
Wijefajsiisje) = (— i e PEL ja 
A E Ja 45 J6 
is also used tn the literature. 
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From the definition (108.2), using the symmetry properties of the 37- 
symbols, we easily see that a 6j-symbol is unchanged by any permutation of 
its three columns, and in any pair of columns the two numbers can be 
simultaneously interchanged. Owing to these symmetry properties, the 
sequence of numbers ji, .. ,j6 in the 6j-symbol can be put in 24 equivalent 
forms.t In addition, the 6j-symbols have another, less evident, symmetry 
property which states an equality between symbols with different sets of 
numbers 7:} 

- Je "| 7 A 2(jetjs+ja—Je) 43 +76 +2 Ee (108.3) 
ja Js jo \Ja 2jatjs+Je—Js) $03 +je+js—J2) 


We may mention a useful relation between the 6j- and 3j-symbols which can 
be derived from the definition (108.2): 


> (—1)fetdotde—me—ms~mme - 15 A 
Mı —mMms5s meg 


MaMe Ma 
«(* j2 ms ja J5 x z f J2 do raen (108.4) 
Ma m2 —Me/ \— Ma M5 M3 mı m2 M3/ \Ja J5 Je 

The expression which is summed on the left-hand side of the equation differs 
from that in (108.2) by the absence of one 3j-symbol. We can therefore say 
that the sum in (108.4) is represented by the tetrahedron (Fig. 45) without 
one of its faces; this determines the difference of the sum from a scalar. 
In other words, as regards transformation properties it corresponds to one 
37-symbol, the one on the right-hand side of equation (108.4), to which it 
must be proportional. The proportionality coefhicient (the 67-symbol on the 
right-hand side of the equation) is easily found by multiplying both sides by 


(2 je J3 
mı mo M3 
and summing over the remaining numbers m, mg, msg. 

The 6j-symbols arise naturally in connection with the following problem 
concerning the addition of three angular momenta. 

Let three angular momenta jy, j2, j3 be added to give a resultant angular 
momentum J. If the value of J (and of its component M) is given, the state 
of the system is not yet uniquely determined, but depends also on the manner 
of addition of the angular momenta (or, as we say, on their coupling scheme). 

For example, let us consider two such coupling schemes: (1) first the 
angular momenta jı and je are added to give a total angular momentum J12, 
and then jiz and jz are added to give the final angular momentum J; (2) the 


+ If we regard the tetrahedron in Fig. 45 as being regular, the 24 equivalent permutations of 
the numbers j can be obtained by means of the 2+ symmetry transformations (rotations and 
refiections) of the tetrahedron. 

t See T. Regge, I] nuovo cimento [10] 10, 544, 1958; 11, 116, 1959. 
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angular momenta jz and Ja are added to give je3, and then jg3 and ji to give J, 
The former scheme corresponds to states in which the quantity J12 (as well 
as ji, J2, J3, J, M) has a definite value; their wave functions will be denoted 
by Y; „Jm (omitting for brevity the repeated suffixes Ji, Jz, j3). Similarly, the 
wave functions of the second coupling scheme are denoted by Wy JM. 
In both cases the values of the “intermediate” angular momentum ( J12 Or 
J23) are in general not unique, so that (for given J and M ) we have two different 
sets of states differing in the values of j12 or jo3. According to the general 
rules, functions of these two sets are related by a certain unitary transfor- 
mation: 


Viam = È Cfreljes>by,.Jm.- (108.5) 


It is evident from physical considerations that the coefficients in this 
transformation are independent of the number M: they must be independent 
of the orientation of the whole system in space. Thus they depend only on 
the values of the six angular momenta ji, J2, J3, J12, J23, J, not on their com- 
ponents, i.e. are scalar quantities (in the sense defined above). The actual 
calculation of these coefficients is easily effected as follows. 

By a repeated application of formula (106.9) we find 


bj JM = 2. <mymes| JM >b7.mb7,m,, 


2 <mimes| JM > <mem3| jz3m23X p,m pi m him, 
bj JM = 2 (mgmy2| JM) <myme|jiemie> p; m pim Pim, 
where (m) denotes that the summation is over all numbers mı, Mo, ... that 


appear in the expression. From the orthonormality of the functions jm we 
have 


(12 j23> 


Il 


f by,uar* hy sm dg 
2 (mgm el JM) <myme3| JM) <mymo|j12mi2)> <memaljogmeg). 


The sum on the right-hand side is taken for a fixed M, but the result is 
actually independent of W (for the reason already mentioned). The sum- 
mation can therefore be extended over the values of M if the sum is multiplied 
by a factor 1/(2J+1). Expressing the coefficients <myme| jm» in terms of 
3j-symbols by (106.10), we obtain the following expression: 


jiel joa) = (— IHI 2j 1)(oat D” F | (108.6) 
Ja J jeg 


The relation between the 6j-symbols and the transformation coefficients in 


(108.5) makes it easy to derive some useful formulae for the sums of products 
of 6j-symbols. 
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First of all, since the transformation (108.5) is unitary and its coefficients 
are real, the relation 
TEL a je] ! 


Sger fe eye ee 
j Ji Jad 


: = OF (108.7) 
J3 Jaj 


holds. 

Next, let us consider the three coupling schemes of three angular momenta, 
with intermediate sums jie, jeg and j31 respectively. The coefficients (108.6) 
of the corresponding transformations are related, according to the matrix 
multiplication rule, by 


2 <jreljes><jesljai> = <Jreljsi>- 


Substituting here (108.6) and renumbering the sufhxes we have 
Senega E AE] PAL (108.8) 
j Jı JS J 72 Js PB Ja J5 J6 
Finally, by considering the various coupling schemes of four angular 


momenta, we can derive} the following addition formula for the products of 
three 6j7-symbols: 
Seyegan ft ee 
5 jo je iG Ngo i Gol ie j 
= (PAEA) ioga) 
Ja JS Je! \J? J8 J9 


(L. C. Biedenharn, and J. P. Elliott; 1953). 

For reference we shall give some explicit formulae for the 6j-symbols. 
In the general case, a 6j-symbol can be written as the following sum: 
fi j2 j3 E eee ee 
| i = A(jrjojs)A(jriste) A Gaze) A (4573) x 


ja J5 Je 





5 (—1}(z+1)! 
x ae oo ee nn a a ee eee eg ee 
(2—j1—jo—js) (2 —j1—js—Je) (2 —ja—je—Je) (2 —Ja—'5 —J3): 


Z 


x 


1 
a Ea a aa ee 


(jitjetjatis—2)\jetististje—2)\(jatsitjetsi— 2)! 
where 
a+b—c)(a—b+c)\(—a+b+c)! T 
alao = [- )( ut ~ | 
(a+b+c+1)! 


and the sum is taken overall positive integers for which none of the factorials 
in the denominator has a negative argument. 


+ See Edmonds’s book quoted 1n §106. 
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Table 10 gives the values of the 6j-symbols for cases where one of the 
parameters is 0, 4 or 1. 

Finally, we shall make a few remarks concerning the higher-order scalars 
constructed from the 3j-symbols. 

The next in complexity after the 6j-symbols is a scalar formed by contract- 
ing products of six 3j-symbols. These 3j-symbols contain 18 numbers j 
equal in pairs, and so the resulting scalar depends on 9 parameters j. It 
is called a 9j-symbol and is defined as followst (E. P. Wigner 1951): 


TABLE 10 
Formulae for 6j-symbols 
PNG OEE a 
local vaen PR 
ja b c yas (s—2b)(s—2¢+1) 1/2 
lve) cee 5 Tee ee 


ab te : (s+ 1)(s—2a) 1/2 
l é oil oom Eo a 





ja b c _ =y s(s + 1)(s —2a —1)(s — 2a) 1/2 
Te oe (2b—1)2b(2 + 1)(2e — 1)2c(Ze + 1) 

ja b c |= Szi 2(s+1)(s—2a)(s — 2b{s~2¢ +1) q1? 
ltc-1 b 2b(2b+ 1)(2b + 2)(2e— 1)2c(2e+ 1) 

ja b c |= E ee 12 
SETANTA (2b + 1)(2b +2)(2b + 3)(2e— 1)2c(2e+ 1) 

ja b c ie 2[b(6 + 1) +.e(c+ 1)—a(a+1)] 

li c b [2b(2b + 1)(2b +.2)2c(2e + 1)(2c +2)}12 


Jin J12 J13 


STS Jur J2 Jia\ f jer joe Jes\ /J31 J32 J33 
J21 J22 J23} = x 
Mii M2 may M22 M23 


: l f 113 77131 M32 M33 
J31 J32 J33 all m 


i Ji w J22 Ee J23 pa (108.11) 


711 Pio, 3] mlio Plon Ngo 213 723 M33 


This quantity can also be written as the sum of products of three 6j-symbols: 


Jur fiz fis So, Oo 
aie, P 1 
is Da ja "a > Dazji d pe ote am ms ee 7) (108.12) 
a ae J32 933 J 7\J21 J jeayt\j J1 fiz 

J31 J32 J33 j 


t According to the general rule of contraction (108.1) it would be necessary to write the 
arguments m in the last three 3j-symbols with the minus sign and to include in the summand 
a factor (—1)29-™), However, by using the property (106.6) of the 3j-symbols and the fact 


that in this case, as is easily seen, the sum 5m of all the nine numbers m is zero, we have the 
definition (108.11). 
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The equivalence of (108.11) and (108.12) can be seen by substituting in 
(108.12) the definition (108.2) and using the orthogonality properties of 
the 3j-symbols. 

The 9j-symbol has a high degree of symmetry, which follows directly from 
the definition (108.11) and the symmetry properties of the 3j-symbols. It 1s 
easily seen that when any two rows or columns are interchanged the 9j-symbol 
is multiplied by (—1)*7. Moreover, the 9j-symbol is unaltered by transposi- 
tion, i.e. interchange of rows and columns. 

Scalars of still higher orders depend on a still larger number of parameters 
j. It is evident that this number will always be a multiple of three (3n- 
symbols). We shall not pause to discuss their properties, but merely mention 
that for every n > 3 there is more than one type of 37j-symbol, and these 
do not reduce to one another. For example, there are two different types of 
12j-symbol.+ 


§109. Matrix elements for addition of angular momenta 


Let us again consider a system consisting of two parts (referred to as sub- 
systems 1 and 2), and let fkọ® be a spherical tensor pertaining to sub-system 
1. Its matrix elements with respect to the wave functions of this sub-system 
are given, According to (107.6), by the formula 


jy Rj me l 
; )ema‘j | fln). 
—m, g mı 
(109.1) 


The question arises of calculating the matrix elements of these quantities 
with respect to the wave functions of the system as a whole. We shall show 
how they may be expressed in terms of the same reduced matrix elements 
as appear in the expression (109.1). 

The states of the system as a whole are defined by the quantum numbers 
jis jo J, M, ni, ne (where J and M are the angular momentum and its com- 
ponent for the whole system). Since fkq™ refers to sub-system 1, its operator 
commutes with the angular momentum operator of sub-system 2. Its matrix 
is therefore diagonal with respect to J2; it is also diagonal with respect to the 
remaining quantum numbers me of this sub-system. These indices j2, 2 will 
be omitted, for brevity, and the required matrix elements will be written as 


nija JM | frq@impnJ™M). 
According to (107.6), their dependence on the number M is given by 


Caen agin = (= 1) ( 


i 


FES 
nyja JM" | frao®mjJMy =i- a (— M'g hJ {fP >. 
(109.2) 


+ A more detailed account of the theory of 9j-symbols and of the properties of 3nj-symbols 
is given in Edmonds’s book quoted in §106 and by A. P. Yutsis, 1. B. Levinson and V. V. 
Vanagas, Mathematical Apparatus of the Theory of Angular Momentum, Oldbourne Press, 
London 1963 (Matematicheskii apparat teorii momenta kolichestva dvizhentya, Vilnius 1960). 
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To establish the relation between the reduced matrix elements on the right- 


hand sides of (109.1) and (109.2) we write, using the definition of the matrix 
elements, 


Cnrj JM’ franj JM = f bam *feg™ bam dq 


a a ae 
= > (SM AIh”, ) x 
Mimi S PR 


ta J 
mı mgo — M 

Substituting here (109.1) and (109.2) and comparing the resulting relation 
with formula (108.4), we see that the ratio of the reduced matrix elements in 
(109.1) and (109.2) must be proportional to a certain 6j-symbol. A careful 
comparison of the two relations mentioned leads to the final formula 


Cm J Jea EA enta /[(2I+ 12S" + 1)] x 


fu’ Bi Je oy l 

xP hemil fe tmjay, (109.3) 
J Ji k 

where J1 max iS the greater of j1, ji’, and Jmin the smaller of J, J’. A similar 

formula for the reduced matrix elements of the spherical tensor pertaining 

to the second sub-system is 


(ngja J| fr neja Jy =(— 1) timet maxtha/[((2J + 1)(2I° + 1)] x 
a FJ 


knj"! fr®|nej2>. (109.4) 
J j2 k 


The lack of complete symmetry between the expressions (109.3) and (109.4) 
(in the exponent of —1) is due to the dependence of the phase of the wave 
functions on the order of addition of the angular momenta. The difference 
must be borne in mind when calculating matrix elements for both sub-systems 
simultaneously. 

We shall also derive a useful formula for the matrix elements, with respect 
to the wave functions of the whole system, of a scalar product (see the defini- 
tion (107.4)) of two spherical tensors of the same rank & pertaining to different 
sub-systems (and therefore commuting). According to (107.10), these matrix 
elements are given in terms of the reduced matrix elements of each tensor 
(with respect to the wave functions of the whole system) by 


Lnr negija JMI fr fe ool minajije J MY 
1 
= JA È MHJ fedoras J nje J fama ja] 
Z 


where we have used the fact that the matrix of a quantity pertaining to one 
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sub-system is diagonal with respect to the quantum numbers of the other 
sub-system. Substituting (109.3) and (109.4) and using the summation 
formula (108.8), we obtain the desired formula expressing the matrix elements 
of the scalar product in terms of the reduced matrix elements of each tensor 
with respect to the wave functions of the corresponding sub-systems: 


<ny'n9'fr jo JMI fx fe oolminaj rye JM 
J je’ qv 


— (= iest trond] j 
ki je 


Ikan fr iniji langel fr |ingje>. (109.5) 


§110. Matrix elements for axially symmetric systems 

The basis for calculating the matrix elements of quantities pertaining to 
systems of the symmetrical-top type is the expression for the integral of the 
product of three D functions. 

To derive this, we return to the expansion (106.11): 


Pim Dim, = 3 <jm|myma> Phim, m = mı + mə, 
} 


and transform both sides by a finite rotation of the coordinates. Each of the 
functions W is transformed according to (58.7), so that we have 


M IM" 


y DY), Dee, Pim Pim, = ~ sy <jm|mym2>D2 ab jm:. 
J m 


Now, expressing the functions jm. on the right by means of the expansion 
(106.9) and comparing the coefficients of the respective products $j m Pim, 
we find the relations 


DS? pL)? L) = Y mame ljm D2a(a)<omame jm), (110.1) 
: 

where m = mı +m, m' = my' + mg’, and w denotes the set of three Eulerian 

angles a, 8, y. Expressed in terms of 3j-symbols, this formula becomes 


Do" m (o) Dam (0) 


= > +n( j2 J \C je J D2, mle) (110.2) 


mi mo —m'/\m, m —m 


here we have also used the property (58.19) of the D functions. . 
Multiplying both sides of equation (110.2) by DY,,.,_,,(w) and integrating 
with respect to w by means of the orthogonality relation (58.20), we have 


| Disa (Din (es) D m gaa eB) ® ®): ame 
WT 


m; mo mg /\m, m2 m3 


here the indices have been renamed in an obvious manner, in order to make 
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the result more symmetrical. This ıs the required formula.t 

Let fxg: be a spherical tensor of rank k pertaining to a top, in coordinates 
x’, v, zZ = &,n, ¢ fixed to the top (with ¢ along the axis): for example, the 
electric or magnetic multipole moment tensor. Let fxq be the components of 
the same tensor relative to the fixed coordinates x, v, z. The relation between 
these is given by the matrix of finite rotations: 


fra = © Dora (110.4) 


‘The wave functions describing the rotation of the system as a whole differ 
from the D functions only as regards normalization: 


Hime = i SD te) (110.5) 
where j is the total angular momentum of the system, 7 its component along 
the fixed z-axis, p its component along the axis of the system: the phase 
factor is chosen so that, for integral; and p = Q, the function (110.5) becomes 
the eigenfunction of the free angular momentum (cf. (103.8)). On calculating 
with respect to these functions the matrix element of the quantity (110.4) by 
means of formula (110.3), and expressing the complex conjugate D function 
by means of (58.19), we find 


{pm | feqljum> 
= g-i (— 1) m V/[(27 + 1)(27' + 1)] x 


x ( es es 7) self: (110.6) 


-p' g p m 


here g' = p’—p,g = mm. 

This formula gives the solution of the problem proposed, expressing the 
dependence of the matrix elements on the angular momenta J, J’ and their 
components m, m’. The dependence on the quantum numbers p, p’ is, of 
course, indeterminate; their values are related to the “internal” states of the 
system, between which the “‘internal” matrix element <p'| fro |p) 18 taken. 

The dependence of the matrix elements (110.6) on the numbers m, m is, of 
course, of the same kind as for any system with a given total angular momen- 
tum. Separating out this dependence by using the reduced matrix elements 
according to (107.6), we obtain for the latter the expression 


Te i Sel Je 
scp = " ae E | : 
= Mayme" i+ 1yays= (1 \ew'l feel. (110.7) 
hog ps 
t For integral values ofj, = l,j = kand j, = lh, and mm,’ = M? = M, = Q, the functions 


Dae reduce, according to (58.25), to spherical harmonic functions. and formula (110.3) gives 
an expression for the integral of a product of three such functions (107.14). 
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The squared modulus of the matrix element (110.6), summed over all 
values of the final number m’ (and over g = m’ — m) for a given m, is indepen- 
dent of the value of m and equal, by the general rule (107.11), to 


Í ' í 
ye t 9 = CARS A 1 9 
2 Gum fedje)? =z J'e find 


"o : "elfele. (110:8) 
ji! g“ 


The Hermitian relations (107.9) for the reduced matrix elements in the 
coordinates x, y, z (110.7) are, as we should expect, in agreement with the 
relations (107.8) 


<ul frale) = (-1)F-© Cul fr, -alp )* 


for the matrix elements in the coordinates £, n, ¢. 

The rotation of such axially symmetric systems as a diatomic molecule (or 
an axial nucleus) is described by only two angles (a = $, B = 8), which 
define the direction of the axis of the system. The rotational wave function 
differs in this case from (110.5) by the absence of the factor e'#y/+/(27); cf. 
the second footnote to §82. This difference, however, does not affect the 
matrix elements: since the dependence of the functions DY) (a, B, y) on y 
is represented by the factor e*” y, formula (110.3) can be written as 


Ôm 0 | Dink % B, OD % By 0) Dim% Bs one Tereg 


-h k 1) j jz a 
mı m2 m3/\m1 me m3 


where m’ = my'+mo' +mg'; the result of calculating the integral is un- 
changed. The selection rule for the axial component of the angular momen- 
tum is the same as before (p'—p = q’), resulting from the orthogonality of 
the electron wave functions (because of the symmetry of the molecule about 
the č-axis). In formulae (110.6) and (110.7), <u'| fral) must now be 
understood as the matrix elements with respect to the electron states for 
nuclei at rest. 


CHAPTER XV 


MOTION IN A MAGNETIC FIELD 


§111. Schrodinger’s equation in a magnetic field 


A PARTICLE that has a spin also has a certain “intrinsic” magnetic moment p. 
The corresponding quantum-mechanical operator ts proportional to the spin 
operator $, and can therefore be written as 


A = pôjs, (111.1) 


where s is the magnitude of the particle spin and p a constant characterizing 
the particle. The eigenvalues of the magnetic moment component are 
uz = pols. Hence we see that the coefficient u (which ts usually called just 
the magnitude of the magnetic moment) is the maximum possible value of 
Hz, reached when the spin component o = s. 

The ratio p/hs gives the ratio of the intrinsic magnetic moment and the 
intrinsic angular momentum of the particle (when both are along the z-axis). 
For the ordinary (orbital) angular momentum, this ratio is e/2mc (see Fields, 
§44). The coefficient of proportionality between the intrinsic magnetic 
moment and the spin of the particle is not the same. For an electron it is 
—|e|/mc, i.e. twice the usual value, as is found theoretically from Dirac’s 
relativistic wave equation (see ROT, §33). The intrinsic magnetic moment 
of the electron (spin $) is consequently — zg, where 


ug = |e|i/2mc = 0-927 x 10-20 erg/gauss. (111.2) 


This quantity is called the Bohr magneton. 

The magnetic moment of heavy particles is customarily measured in 
nuclear magnetons, defined as eh/2myc, with mp the mass of the proton. The 
intrinsic magnetic moment of the proton is found by experiment to be 2-79 
nuclear magnetons, the moment being parallel to the spin. The magnetic 
moment of the neutron is opposite to the spin, and is 1-91 nuclear magneton. 

It should be noted that the quantities and s on the two sides of (111.1) 
are the same type of vector, as they should be: both are axial vectors. 
A similar equation for the electric dipole moment d (= constant x s) would 
contradict the symmetry under inversion of the coordinates: the relative sign 
of the two sides would be changed by inversion.+ 


t Such an equation (and therefore the existence of an electric moment of an elementary 
particle) would also contradict the symmetry under time reversal: a change in the sign of the 
time does not alter d, but does change the sign of the spin (as is evident, for example, from 
the definitions of these quantities in orbital motion, that of d involving only the pe R 
whereas that of the angular momentum also involves the velocity of the particle). i 
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In non-relativistic quantum mechanics, the magnetic field may be regarded 
as an external field only. The magnetic interaction between the particles is a 
relativistic effect, and a consistent relativistic theory is needed to take it into 
account. 

In the classical theory, the Hamilton’s function of a charged particle in an 
electromagnetic field is 


where ¢ is the scalar and A the vector potential of the field, and p the general- 
ized momentum of the particle; see Fields, §16. If the particle has no spin, 
the transition to quantum mechanics can be made in the usual manner: the 
generalized momentum must be replaced by the operator p = —zAV, and 
we obtain the Hamiltoniant 


n i A € 2 
Fie 5,(8-£A) a ei (111.3) 
C 


If, on the other hand, the particle has a spin, this procedure does not sufħce. 
This is because the intrinsic magnetic moment of the particle interacts 
directly with the magnetic field. In the classical Hamilton’s function, this 
interaction does not appear, since the spin, which is a purely quantum effect, 
vanishes in the limit of classical mechanics. The correct expression for the 
Hamiltonian is obtained by including in (111.3)an extra term—y. H corres- 
ponding to the energy of the :nagnetic moment yp in the field H. Thus the 
Hamiltonian of a particle having a spin is] 


A 2 
ne Fa P-£A) eee (111.4) 
2m c 


In expanding the square (6B —eA/c)*, we must bear in mind that p does not 
in general commute with the vector A, which is a function of the 
coordinates. Hence we must write 


HA = p2/2m—(e/2mc)(A P+ p -A)+e2A2/2me2—(y/s)8. H+ U. (111.5) 


According to the rule (16.4) for the commutationof the momentum operator 
with any function of the coordinates, we have 


p.A—A.p = —ih divA. (111.6) 


+ The generalized momentum is here denoted by the same letter p as the ordinary momen- 
tum (and not by P as in Fields, §16), in order to emphasize that it corresponds to the same 
operator. 

t There should be no misunderstanding here caused by the use of the same letter for the 
field and the Harniltonian, since the latter always has a circumflex over it. 
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Thus p and A commute if div A = 0. This holds, in particular, for a 
uniform field, if its vector potential 1s expressed in the form 


A = }Hxr. (111.7) 


The equation zhé¥ êt = HY with the Hamiltonian (111.4) is a generaliza- 
tion of Schrédinger’s equation to the case where a magnetic field is present. 
The wave functions on which the Hamiltonian acts in this equation are 
symmetrical spinors of rank 2s. 

The wave functions of a particle in an electromagnetic field are not 
uniquely defined, because the choice of the field potentials is not unique: they 
are defined (see Fields, §18) only to within a gauge transformation 


Peed ON E -2% (111.8) 
Cc 


where f is an arbitrary function of the coordinates and the time. This trans- 
formation does not affect the values of the field strengths, and it is therefore 
clear that it cannot essentially alter the solutions of the wave equation; in 
particular, it must leave |F|? unchanged, since it is easy to see that the original 
equation is restored if we make the changes (111.8) in the Hamiltonian and 
at the same time change the wave function according to 


F >Y exp (zef/iic). (1119) 


This non-uniqueness of the wave function does not affect any quantity 
having a physical significance (in whose definition the potentials do not 
appear explicitly). 

In classical mechanics, the generalized momentum of a particle is related to 
its velocity by 


my = p—eA/c. 


In order to find the operator ¢ in quantum mechanics, we have to commute 
the vector r with the Hamiltonian. A simple calculation gives the result 
my = p—eA/c, (111.10) 


which is exactly analogous to the classical expression. For the operators of 
the velocity components we have the commutation rules 


{dz, y} = i(eh/m*c)Hz, 
{óy Oz} = i(eh/m2c)H,, (111.11) 
{fz Gz} = i(eh/m*c)H,, 


which are easily verified directly. We see that, in a magnetic field, the 
operators of the three velocity components of a (charged) particle do not 
commute. This means that the particle cannot simultaneously have definite 
values of the velocity components in all three directions. ` 
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In motion in a magnetic field, the symmetry with respect to time reversal 
occurs only if the sign of the field H (and of the vector potential A) is changed. 
This means (see §§18 and 60) that Schrédinger’s equation Ay = Ey must 
keep the same form when we take complex conjugates and change the sign of 
H. This is immediately evident for all terms in the Hamiltonian (111.4) 
except — §.H. The term — §.Hy in Schrédinger’s equation becomes 
§* . H* under the transformation in question, and at first sight this destroys 
the required invariance, since the operator §* is not the same as —§. It must 
be remembered, however, that the wave function is in reality a contravariant 
spinor y^™ =, which has the complex conjugate y*** transformed as a 
covariant spinor (see §60). The spinor yžą„... is contravariant. Using 
(57.4) and (57.5) to find the components (§.Hy)* and expressing them in 
terms of Yt, , we can see that under time reversal a Schrédinger’s 
equation for the YX, is obtained which has the same form as the original 
equation for the y“*~. 


§112. Motion in a uniform magnetic field 


Let us determine the energy levels of a particle in a constant uniform 
magnetic field (L. D. Landau 1930). The vector potential of the uniform 
field is conveniently taken here not in the form (111.7), but as 


wS Fig, Ay = A, =0 (112.1) 


(the z-axis being taken in the direction of the field} 
The Hamiltonian then becomes 


ee p 
H = È Camne e E s H. (112.2) 
2m 2m 2m ; 

First of all, we notice that the operator fz commutes with the Hamiltonian, 
since the latter does not contain the operators of the other components of the 
spin. This means that the z-component of the spin is conserved, and there- 
fore that §, can be replaced by the eigenvalue sz = o. Then the spin depen- 
dence of the wave function becomes unimportant, and ys in Schrödinger’s 
equation can be taken as the ordinary coordinate function. For this function 
we have the equation 


H| (2+ Ey) +But +82? yor = By. (112.3) 
2m c s 


The Hamiltonian of this equation does not contain the coordinates x and z 
explicitly. The operators pz and fz (of differentiation with respect to x and 2) 
therefore also commute with the Hamiltonian, i.e. the x and z components of 
the generalized momentum are conserved. We accordingly seek 4 in the form 


Y = elt IANprz+P:2y( y). (1 12.4) 
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The eigenvalues pz and pz take all values from — œ to + œ. Since Az = Q, 
the z-component of the generalized momentum is equal to the ordinary 
momentum component mvz. Thus the velocity of the particle in the direction 
of the field can take any value; we can say that the motion along the field is 
“not quantized”. 

Substituting (112.4) in (112.3), we obtain the following equation for the 
function x(y): 


9 
x +B] (E+ MH BE) -pman fx = 0, (1125) 
h? s 2m 
with the notation yo = —cpz/eH and 
wH = |e Hmc. (112.6) 


Equation (112.5) is formally identical with Schrédinger’s equation (23.6) for 
a linear oscillator, oscillating with frequency wg. Hence we can conclude 
immediately that the expression in round brackets in (112.5), which takes the 
part of the oscillator energy, can have the values (n + 43)iwy, where n = 
Dal 2 ae 

Thus we obtain the following expression for the energy levels of a particle 
in a uniform magnetic field: 


E = (n+4})iwy + p22/2m—poH|s. (112.7) 


The first term here gives the discrete energy values corresponding to motion 
in a plane perpendicular to the field; they are called Landau levels. For an 
electron, pjs = —|el|k/mc, and formula (112.7) becomes 


E = (n+4}+o)hwy + p2?/2m. (112.8) 


The eigenfunctions x,(y) corresponding to the energy levels (112.7) are 
given by (23.12) with the appropriate changes of notation: 


Í Em y— yo 
Xn(y) a. eat Dag of Ga ) ( ) 


whereay = 4/(h/mwy). 

In classical mechanics, the motion of particles in a plane perpendicular to 
the field H (the xy-plane) takes place in a circle about a fixed centre. The 
quantity yo, which ts conserved in the quantum case, corresponds to the 
classical y coordinate of the centre of the circle. The quantity xo = cpyeH +x 
is also conserved, it is easy to see that its operator commutes with the 
Hamiltonian (112.2). This quantity xy corresponds to the classical x co- 
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ordinate of the centre of the circle.t The operators £9 and Jo, however, do 
not commute. In other words, the coordinates xo and yo cannot take definite 
values simultaneously. 

Since (112.7) does not contain the quantity pz, which assumes a continuous 
sequence of values, the energy levels are continuously degenerate. However, 
the degree of degeneracy becomes finite if the motion in the xy-plane is 
restricted to a large, but finite, area S =L,L,. The number of (now 
discrete) possible values of p, in an interval Ap, is (L,/27h)Ap,. All values 
of p, are admissible for which the orbit centre is inside S (we neglect the 
radius of the orbit in comparison with the large quantity L,). From the con- 
dition 0 < vo < Ly we have Ap; = eHL,/c. Hence the number of states 
(for given 7,pz) is elt S[2nhe. If the region of motion is bounded in the 
z-direction also (dimension Lz), the number of possible values of pz in 
an interval Ap; is (L,/27h)Ap; and the number of states in this interval is 


eHS Lz, _ eHVAp, 
ahan ~~ 4 ®h®e 





(112.10) 


For an electron there is an additional degeneracy: the levels (112.8) with 
n,o = 4andn+1,o = —} are the same. 


PROBLEMS 
PROBLEM 1. Find the wave functions of an electron in a uniform magnetic field in states 
in which it has definite values of the momentum and angular momentum in the direction of 
the field. 


SoLuTION. In cylindrical polar coordinates p, ¢, z with the s-axi« .n the direction of the 
field, the vector potential has components Ag = 4Hp, Az= Ap =,0, and Schrödinger’s 
equation isf 





KPa aby Ay 1 eh) ipa, ay ; 
mye —Nihwy L H Mwg ph = Eb 1 
aM ; A An ap? z 72 — th toe wH P (1) 


We seek a solution in the form 


l 
D Rip)eimeeip zih, 
Y On) (p) 2 





obtaining for the radial function the equation 


2 i 2 2 - 
al Fricke = + |£- Boa Mwg’ p? —- bhy |R = 0. 
2M m 2M 


+ For, in classical motion in a circle of radius cmvt/eH (where vz is the projection of the 
velocity on the xy-plane; see Fields, §21), we have 
Yo = —eprleH = —emvuzleH +y. 
It is evident from this that yọ is the y coordinate of the centre of the circle. The other co- 
ordinate is 
xo = emvyfeH +x = ChyleH +x. 
t The electron charge is written as e = —Je|, and the electron mass iS denoted by M to 
distinguish it from the angular momentum m. The spin term is unimportant in this problem, 
and is omitted. 
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Defining a new independent variable £ = (Mwn/2h)p?, we can write this equation in the form 
mè 
ER” +R + (-16+2- =)R =0, 


B = Tog EF) i 


hwg 


As € > © the required function behaves as e-#, and for E — 0 as Elmi/2, Accordingly we 
seek a solution in the form 


R(E) = e-€/2gI 2); 
the equation for w(£) is satisfied by the confluent hypergeometric function 


w = F{—(B—$3|m|—}), |m| +1, £}. 


If the wave function is everywhere finite, the quantity B—4}m|—4 must be a non-negative 
integer mp. The energy levels are then given by the formula 


2 
E = ot Sen + imt), 
wyna + mj? 5) aM 


which is equivalent to (112.7). The corresponding radial wave functions are 


| 17172 2 
Rapml o) = ee at | a | exp ( = an x 


ap! m| ! 2imel yy! 4a;;27 


x F(—n,, m| +1, p? 2a’), (2) 


where ay = V (lM wn); these are normalized by the condition 


Re de a 


v 


The hypergeometric function is here a generalized Laguerre polynomial. 


PROBLEM 2. Find the lowest energy level corresponding to a bound state of an electron in 
a potential well U(r) of small depth (U| <A?/ma?, where a is the range of the forces in the 
well), with a uniform magnetic field superimposed (Yu. A. Bychkov 1960). 


SOLUTION. The condition stated for the field U(r) ensures (in the absence of the magnetic 
field) that perturbation theory is applicable to it, and there are no bound states in the well 
(§45). When the magnetic field also is present, U(r) can be regarded as a perturbation only 
for the motion in the plane transverse to H; the (discrete) nature of the energy spectrum of 
this motion is unaltered when U is applied. The nature of the motion in the direction of H is 
altered, however, becoming (as we shall see) finite instead of infinite, i.e. the spectrum 
becomes discrete instead of continuous. Thus for this motion the field of the well cannot be 
treated by perturbation theory. 

Accordingly, when the variables are separated in Schrödinger’s equation (equation (1) of 
Problem 1, with the added term Ud on the left), the radial functions R{p) are taken ia the 
previous form (2); the lowest level corresponds to the values mp = m = 0 of the quantum 
numbers. Substituting in Schrédinger’s equation % = Roo(p)x(z), multiplying by Roo( p) 
and integrating over pdp, we obtain for x(z) the equation 


he os = 
Bes T C(s)x = €x, (3) 
where « = E—}hu, 


Cia) = | LEVE + PRR p) od. 
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and m is again the mass of the particle. This equation has the same form as Schrédinger’s 
equation for one-dimensional motion in a potential well U(z), with e the energy of this motion. 


We can therefore simply use the result of §45, Problem 1, according to which the discrete 
level is 


= - oa f | L(y (22 + p*)) Roo p)p dp az (4) 


The wave function Roo( p) is damped at distances p~ an. If the magnetic field is so weak 
that aya, the integral over p is governed by the range p <a, in which we can take Roof P) 
se Ro (0) = 1/an. Then 

me?H? f [ ,. -\e 
= = m- Uir}dl ; 
: ac ( | 4 ) ©) 
here dV = 27 p dp dz—74nr7dr. In the opposite case of a strong magnetic field, when an <a, 


the integral in (4) is governed by the range p < 4H, in which we can put U(4/(z? + p°) œ U2). 
Then the integral over p reduces to the normalization integral of the function Roo and is 


equal to unity, so that 
ee -aef Í Ul) ds)" (6) 
i= 


In either case, an estimate of the integral shows that e<hwn. 


PROBLEM 3. Find the energy levels of a hydrogen atom in a magnetic field that 1s so strong 
that ay ap, where ag is the Bohr radius (R. J. Elliott and R. Loudon 1960). 


SoLUTION. With the condition stated, Awy > me*/h?, the influence of the Coulomb field 
of the nucleus on the motion of the electron in the plane transverse to H may be regarded as a 
small perturbaticn. We thus return to the situation considered in Problem 2, and equation (3) 
may be used, with 


an 


is | Rool p)P 
= _ 2 ee 
U(z) = e VA z AP (7) 


v 


By writing the radial function Roo in this expression, we take only the energy levels of the 
longitudinal motion, pertaining to the zero Landau level (wp) of the transverse motion. | 

The ground-state wave function Xo(z) extends to a distance |z| sag and varies slowly over 
this distance (without zeros, so that ıt does not vanish at z = 0). Hence the ground level 
satisfies the conditions used in §45, Problem 1, and we can use formula (6), which is based 
on the solution of that problem. The logarithmically divergent integral is ‘‘cut off” at an 
upper limit of distance |z|~ ag and at a lower limit |z|~ an (where |2|~ p and it is not per- 
missible to replace +/(p*+ 27) by |z| in (7)). The result is 


Z2me4 a 
€9 = Bare log? 2B 


ay 
me’ 


h8H 
2h? e m?c|el’ 


This formula has what is called logarithmic accuracy. The assumption made is that not only 
the ratio aplan but also its Jogarithm is large. A numerical factor in the argument of the 
logarithm remains undetermined. l 

The excited states of the discrete spectrum are obtained as solutions of Schroédinger’s 
equation (3) with the field O(2) —e*/z (obtained from (7) with z~ ag> p). But this equation 
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can be brought, by the substitution y = z¢4(z), to the form 


e7d 


d 
~~ ( a — (9) 


which is the same as the equation for the radial wave functions of s states in the three- 
dimensional Coulomb problem. The required levels are therefore given by (36.10): 


E = — met DRN? (10) 


with n = 1, 2, 3, .... This expression too has only logarithmic accuracy; the next correction 
term would be small in comparison with the principal term, but only in the ratio 1/log 
(aglan). 

Equation (9) gives the wave function only for z>0. It can be continued into the region 
z<0 as x(—2z) = x(z) or as x(—z) = —x(z). Accordingly, in this approximation the levels 
(10) are doubly degenerate. This degeneracy is, however, removed in higher approximations 
with respect to au/ag. 


§113. An atom in a magnetic field 
Let us consider an atom in a uniform magnetic field H. Its Hamiltonian 
is 
Ee ar 2. | Ba+ Sacre) us lly. 8, (113.1) 
2m > c mc 


m 


where the summation is taken over all the electrons (the electron charge being 
written as —|e|); U is the energy of interaction of the electrons with the 
nucleus and with one another, and § = Eŝ4 is the operator of the total 
(electron) spin of the atom. 

If the vector potential of the field is taken in the form (111.7), then, as 
already noted, the operator p commutes with A. Expanding the bracket in 
(113.1) and denoting by Hp the Hamiltonian of the atom in the absence of 
the field, we find 

_ r lel a e? alely & 
A For SD. Aa Bat 3 2 Ag+ TH. $. 


c? m 


Substituting A from (111.7), we obtain 


i e? 


3 h 
A = Hot LH. D raxpa+ Je] 


> (H xra)?+ “4H. $. 


mC 





8mc? 
The vector ra x Pa, however, is the operator of the orbital angular momentum 
of the electron, and the summation over all the electrons gives the operator 
AL of the total orbital angular momentum of the atom. Thus 

H = Ro+peBĖ +28) .H + (e2/8me?) X (H xr,)?, (113.2) 


where pg is the Bohr magneton. The operator 


A, = ~—un(L +28) (113.3) 
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may be regarded as the operator of the “intrinsic” magnetic moment of the 
atom, which it possesses in the absence of the field. 

The external magnetic field splits the atomic levels and removes the 
degeneracy with respect to the directions of the total angular momentum (the 
Zeeman effect). Let us determine the amount of this splitting for atomic 
levels having definite values of the quantum numbers J, L and $ (i.e. 
assuming the case of LS coupling for the levels; see §72). 

We shall assume that the magnetic field is so weak that ugH is small com- 
pared with the distances between the energy levels of the atom, including the 
fine-structure intervals. Then the second and third terms in (113.2) can be 
regarded as a perturbation, the unperturbed levels being the separate com- 
ponents of the multiplets. In the first approximation we can neglect the third 
term, which is quadratic with respect to the field, in comparison with the 
second term, which is linear. 

In this approximation, the energy AE of the splitting is determined by the 
mean values of the perturbation in the (unperturbed) states which have 
different values of the projection of the total angular momentum on the 
direction of the field. Taking this direction as the z-axis, we have 


AE = ppH(L,+282) = peH(J2+ S2). (113.4) 


The mean value Jz is just the given eigenvalue of J = My. The mean value 
S, can be found as follows, using stepwise averaging (cf. §72). 

We first average the operator S over a state of the atom with fixed values of 
S, Land J, but not of Mg. The operator § thus averaged must be ‘‘parallel”’ 
to J, the only conserved “vector” characterizing a free atom. We can there- 
fore write a 


S = constant xJ. 


In this form, however, the equation is purely conventional, since the three 
components of the vector J cannot simultaneously have definite values. Its 
z-component can be taken literally: 


S, = constant x J, = constant x My, 
as can the equation 
S.J = constant xJ? = constant x J(J + 1), 
which is obtained on multiplying both sides by J. Taking the conserved 
vector J under the averaging sign gives §.J = S.J. The mean value §.J 
is the same as the eigenvalue 
S.J = 4{JJ+1)-L(L+1)+S(S+1)), 


to which it is equal in a state having definite values of L2, S2 and J?; cf. 
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formula (31.4). Determining the constant from the second equation and 
substituting in the first equation, we therefore have 


Sz = MJ. S/J?. (113.5) 


Collecting the above expressions and substituting in (113.4), we find the 
following final expression for the amount of the splitting: 


AE = pegMy/H, (113.6) 
where 


JJ +1)-L(£4+1)+ S($ +1) (113.7) 
2J +1) 


is called the Landé factor or the gyromagnetic factor. If there is no spin 
(S= 0, mad E — od = S)¢ = a 

Formulae (113.6) gives different values of the energy for all the 2J +1 
values My; = —J, —J+1, ..., J. Thus the magnetic field completely 
removes the degeneracy of the levels with respect to directions of the angular 
momentum, unlike the electric field, which leaves the levels with My = 
+|My| unsplit (§76).[ However, the linear splitting described by (113.6) 
does not occur if g = 0; this can refer even to states for which J # 0, such as 
4D41/2. 

We have seen in §76 that there is a relation between the displacement of an 
energy level of an atom in an electric field and its mean electric dipole 
moment. A similar relation exists in the magnetic case. The potential energy 
of a system of charges is given, in classical theory, by — a . H, where p is the 
magnetic moment of the system. In the quantum theory, it is replaced by the 
corresponding. operator, so that the Hamiltonian of the system is 


yea 


H = HAo-fi.H = Ao- ÊH. 


Now, applying (11.16), with the field H as the parameter A, we find that the 
mean value of the magnetic moment is 


fiz = —OAE/OH, (113.8) 


where AE is the displacement of the energy level for the given state of the 
atom. Substituting (113.6), we see that an atom in a state with a definite 


t The splitting is described by the general formulae (113.6) and (113.7) is often called the 
anomalous Zeeman effect. This unfortunate name arose because, before the spin of the 
electron was discovered, the effect described by (113.6) with g = 1 was regarded as normal 

t} The arguments applied to the electric-field case in §76 are not valid for a magnetic field. 
The reason is that H is an axial vector and therefore changes sign on reflection in any lane 
containing it. Hence the states obtained from each other by this operation belong t a 
different fields, not in the same field. ot aia 


466 Motion in a Magnetic Field §113 


value My of the projection of the angular momentum on some direction 2 
has a mean magnetic moment in that direction 


pes eae NG. (113.9) 


If the atom has neither spin nor orbital angular momentum (S = L = 0), 
the second term in (113.2) gives no displacement of the level, either in the 
first approximation or in any higher one (since the matrix elements of L 
and S vanish). Hence, in this case, the whole effect arises from the third 
term in (113.2), and in the first approximation of perturbation theory the 
displacement of the level is equal to the mean value 





beta E. 
he 2% _ > ieee). (113.10) 


Smc2 


Putting (H x ra)? = H?ra?sin?0, where @ is the angle between ra and H, 
and averaging with respect to the directions of r,, we have sin?@ = 1—cos?@ 
= 2/3 (the wave function of a state with L = S = 0 is spherically sym- 
metrical, and so the averaging over directions is independent of that over 
distances7,). Thus 





, = 
RP a a 113.11 
12mc? > l ) 


The magnetic moment calculated from (113.8) is then proportional to the 
magnitude of the field (an atom with L = S = O has, of course, no magnetic 
moment in the absence of a field). Writing it in the form yH, we can regard 
the coefficient y as the magnetic susceptibility of the atom, given by Langevin’s 


formula (P. Langevin 1905): 








2 
TE E- > re (113.12) 


It is negative, i.e. the atom is diamagnetic.T 

If ] = 0, but S = L # U, the displacement linear with respect to the field 
again vanishes, but the quadratic effect from the perturbation —Hat - H 
in the second approximation exceeds the effect (113.11).[ This is because, 
according to the general formula (38.10), the correction to the eigenvalue 
of the energy in the second approximation is given by a sum of expressions 
whose denominators contain the differences between the unperturbed 


+ The Thomas—Fermi model cannot be used to calculate the mean square distance of the 
electrons from the nucleus. Although the integral | nr? dr with the Thomas—Fermi density 
n(r) converges, it does so too slowly, and therefore the results are quite different from those 


given by experiment. 7 
t For S = L -Æ O, the non-diagonal matrix elements of Lz, Sz for the transitions S, L, J >S, 


L, J+1 are not in general zero. 
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energy levels, in this case the fine-structure intervals of the level, which 
are small quantities. We have remarked in §38 that the correction to the 
normal level in the second approximation is always negative. Hence the 
magnetic moment in the normal state is positive, i.e. an atom in the normal 
state with J = 0, L = S ¥ 0 is paramagnetic. 

In strong magnetic fields, where uo H is comparable with or greater than 
the intervals in the fine structure, the splitting of the levels differs from 
that predicted by formulae (113.6), (113.7); this phenomenon is called the 
Paschen—Back effect. 

The calculation of the energy of the splitting 1s very simple in the case 
where the Zeeman splitting is large in comparison with the intervals in the 
fine structure but still, of course, small compared with the distances between 
the different multiplets (when it may be shown that we can, as before, neglect 
the third term of the Hamiltonian (113.2) in comparison with the second). 
In other words, the energy in the magnetic field considerably exceeds the 
spin-orbit interaction.t Hence we can neglect this interaction in the first 
approximation. The projections M z and M s of the orbital angular momen- 
tum and spin are then conserved, as well as the projection of the total angular 
momentum, so that the splitting is given by the formula 


AE = ppH(M,+2Ms). (113.13) 


The multiplet splitting is superposed on the splitting in the magnetic 
field. It is determined by the mean value of the operator AL . § (72.4) with 
respect to the state with the given Mz, Mg (we are considering the multi- 
plet splitting due to the spin-orbit interaction). For a given value of one of 
the angular momentum components, the mean values of the other two are 


zero. Hence L.S= M, Mg, so that the energy of the levels is given in 
the next approximation by the formula 


AE = upH(M,_,+2Ms)+AM_Ms. (113.14) 


The calculation of the Zeeman effect in the general case of any type of 
coupling (not LS) is not possible. We can say only that the splitting (in a 
weak field) is linear with respect to the field and proportional to the projection 
My of the total angular momentum, i.e. it has the form 


AE = uB8EnJHM o3, (113.15) 


where the gny are some coefficients characterizing the term in question; 7 
denotes the assembly of all the quantum numbers, except J, which charac- 
terize the term. Though these coefficients cannot be calculated separately, it 
is possible to obtain a formula, useful in applications, which gives their sum 


taken over all possible states of the atom with the given electron configuration 
and total angular momentum. 


t For intermediate cases, where the effect of the magnetic field is comparable with the spin- 
orbit interaction, the splitting cannot be calculated in a general form; the calculation for S = 4 
is given in Problem 1. 3 
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By definition, 


gniMy = <nJMy\Lz+2S2|nJM5). 


The quantities g sry Mj, wheregsz,is the Landé factor (113.7) corresponding 
to LS coupling, are the diagonal matrix elements 


gstyMy = (SLIM ys|Lz ate 2ZSASLI MIY, 


calculated with a different complete set of wave functions. The functions in 
each set are obtained from the other set by a linear unitary transformation. 
But this transformation leaves unchanged the sum of the diagonal matrix 
elements (§12). Hence we conclude that l 


Z gngMy = 2 gsLIM3, 


or, since gny and gszy do not depend on M, 


È gny = ASII: (113.16) 
SL 


The summation is taken over all states with the given value of J which are 
possible for the given electron configuration. This is the required relation. 


PROBLEMS 
PROBLEM 1. Determine the splitting of a term with S = 4 by the Paschen-Back effect. 


SoLuTION. The magnetic field and the spin-orbit interaction have to be taken into 
account simultaneously by perturbation theory, i.e. the perturbation operator ist 


F = AL -Stpuale+2S:)H. 


As the initial wave functions for the zero-order approximation, we take functions correspond- 
ing to states with definite values of L, S = 3, Mz, Ms (L given; Mt = —L,...,L;Ms = +4). 
In the perturbed states, only the sum M= My = Mr +M s 1s conserved (V commutes with Ja. 
so that we can ascribe definite values of M to the components of the split term. 

The values M = +(L+4) can occur in only one way each: with |MiMs> = |L4> and 
{|—L, —4>. Hence the corrections to the energy of the states with these M are simply equal 
to the diagonal matrix elements <MiMs|V|MziMs> with the indicated values of |MLMs>. 
The remaining values of M can occur in two ways each: |4f—4, $> and |M +}, —4. Here 
two different values of the energy correspond to each M; they are determined from the secular 
equation formed from the matrix elements for transitions between these two States. The 
matrix elements of L.S are calculated by directly multiplying the matrices <M1|L|M'1> 
and </Ms|S|M’s>, and are 


(My 3ATsi|L . S AMM sy = Mily, 


(M+4, -IL SIAT- 3. > = Mh AL. SIM 43, — 
= W/CL +A 44><L -M 4+ 34)). 


+ We do not include in P the term proportional to (L . Š}? (the spin-spin interaction). It 
must be borne in mind, however, that, for a spin S = 4, the expression (L . S)? reduces by 
virtue of the properties of the Pauli matrices {see §55) to L . 5, and is therefore included in 
the formula as written here. 
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In the absence of a magnetic field, the term is a doublet, the distance between the com- 
ponents being e = A(L +4); see (72.6). We take the lower of these levels as the origin of 
energy. Then the final formulae for the levels in the magnetic field are 


E = etpph(L +1) for W = HLHI), 
E= = be+pBH M t y [\(¢ + pe H?) + un Me (2L +1), 
M=L-},..., -L+}. 


For paH/e<1 we have 
Er = €+ppHM 2(L+ {2L + l), E- = eH M . ZINC +1), 


in accordance with formulae (113.6), (113.7) (in which we must put S = 4}, J = L+}). 
For pal/e>1 we have 


We 





E+ = ppH(W+3)+he+ 


in accordance with (113.14). 


PROBLEM 2. Determine the Zeeman splitting for the terms of a diatomic molecule in 
case a. 

SOLUTION. The magnetic moment arising from the motion of the nuclei is very small in 
comparison with the magnetic moment of the electrons. Hence the perturbation due to the 
magnetic field can be written for the molecule as for a system of electrons, i.e. m the form 
used previously: V = prH .(L+2S), where L, S are the electron orbital and spin angular 
momenta. 

Averaging the perturbation with respect to the electron state, we have in case a 


paHn2(.\ + 2%) = prin«{2Q = A). 


The mean value of nz with respect to the rotation of the molecule is the c'agonal matrix 
element 


JMi JM = OMIT +0, 


where M = My, the matrix element is calculated from the reduced matrix element given by 
(87.4) with J and Q in place of K and A. Thus the required splitting is 


O(2Q — A) 


SEP tt a 
KI +3) 


PROBLEM 3. The same as Problem 2, but for case b. 


SOLUTION. The diagonal matrix elements <AKJ|V|AKJ> which determine the required 
splitting could be calculated from the general rules given in §87. However, it is simpler and 
more comprehensible to perform the calculation as follows. Averaging the perturbation 
operator with respect to the orbital and electron states, we obtain 


eBH (Anz + 2S.) 


(the spin operator is unaffected by this averaging). Next, we average with respect to rotation 
of the molecule; the mean value of nz is given by formula (87.4), and so we have 


pBH[AHK(K + 1)}R: +28). 


Lastly, we average with respect to the spin wave function; after the whole averaging, the mean 
values of the vectors must be directed parallel to the total angular momentum J, which is the 
only conserved vector. Hence we have (cf. (113.5)) 


BH A2 


J+ KK) J+28.J|u 
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(iM = M3), or finally 


A? 7 
aan I)+K(K+1)~ (8+ 1) + 





+[KJ=1)-A(K=1)+S(S+ DJH. 


PROBLEM 4. A diamagnetic atom is 1n an external magnetic field. Determine the strength 
of the induced magnetic field at the centre of the atom. 


SoLuTION. For S = L = 0 the Hamiltonian contains no perturbation linear in the field, 
and so the wave function of the atom involves no correction of the first order with respect 
to the magnetic field. The change j’ in the electron current in the atom induced by the 
external magnetic field is due (again in the first approximation with respect to H) only to the 
addition of the term (|e|/mmc)A to the electron velocity operator. We therefore havet 


j = —ple= mcjA = — ple? 2mc)H xr, (1) 


where p is the electron density in the atom. The magnetic field produced at the centre of the 
atom by this additional current is 


Het [Ear 


d 
in és 3 


cf. (121.8). Substituting (1) and averaging in the integrand over the directions of r, we obtain 


e? 
Hina = oe [fav 


mc? f 


= ——¥.(0)H , (2) 
3mc? 


where ¢,(0) is the potential of the field at the centre of the atom due to its electron envelope. 
In the Thomas—Fermi model ¢e(0) = —1-80Z4/3me3/ħ? (see (70.8)), so that 


Hina = —0-60(e?/hc)?Z4/3 A 
= —3-2x10-5243H, 


§114. Spin in a variable magnetic field 


Let us consider an electrically neutral particle having a magnetic moment, 
and situated in a magnetic field which is uniform but varies with time. 
We may have in mind either an elementary particle (a neutron) or a complex 
one (an atom). The magnetic field is supposed so weak that the magnetic 
energy of the particle in the field is small compared with the intervals between 
its energy levels. Then we can consider the motion of the particle as a whole, 
its internal state being given. i 

Let & be the operator of the “intrinsic” angular momentum of the particle— 
the spin of an elementary particle, or the total angular momentum J for an 
atom. The magnetic moment operator can be represented in the form 
(111.1). The Hamiltonian for the motion of a neutral particle as a whole can 
be written 


A = —(y/s)§.H; (114.1) 


+ This expression corresponds to the Larmor precession of the electron envelope of the 
atom round the direction of the external magnetic field; see Frelds, §45. 
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here we write out only the part of the Hamiltonian that depends on the spin. 

In a uniform field, this operator does not contain the coordinates ex- 
plicitlyt. Hence the wave function of the particle falls into a product of a 
coordinate and a spin function. Of these, the former is simply the wave 
function of free motion; in what follows, we shall be interested only in the 
spin part. We shall show that the problem of a particle with any angular 
momentum s can be reduced to the simpler problem of the motion of a 
particle of spin $(E. Majorana). To do this, it is sufficient to use the method 
which we have already employed in §57. That is, instead of one particle of 
spin s, we can formally introduce a system of 2s “particles” of spin 4. The 
operator $ is then represented as a sum È 8, of the spin operators of these 
“particles”, and the wave function as a product of 2sspinors of rank one. The 
Hamiltonian (114.1) then falls into the sum of 2s independent Hamiltonians: 


H = 2 Aa, Ha = —(z/s)H . Ŝa, (114.2) 


so that the motion of each of the 2s “particles” is determined independently of 
the others. When this has been done, we need only reintroduce the com- 
ponents of an arbitrary symmetrical spinor of rank 2s in place of the products 
of components of 2s spinors of rank one. 


PROBLEMS 


PROBLEM 1. Determine the spin wave function for a neutral particle of spin $, in a uni- 
form magnetic field which is constant in direction but varies in absolute magnitude 
according to an arbitrary law H = H(t). 


SOLUTION. The wave function is a spinor y» satisfying the wave equation 
they ct = -2H . 8yr, (1) 


Taking the direction of the field as the Z-axis, we can write this equation in spinor com- 
ponents: 


ih ep lt = pH, 1h pelet = pA 
Hence 
YA = celiuih) | Hat, $2 = ce (iuinyf Hat, 


The constants ¢,, cz must be determined from the initial conditions and from the normaliza- 
tion condition | 24/7 +] 471? = 1 


PROBLEM 2. The same as Problem 1, but for a uniform magnetic field constant in absolute 


magnitude, whose direction rotates uniformly, with angular velocity w, around the z-axis and 
atan angle 6 to it. 


SOLUTION. The magnetic field has the components 


H. = Hsin f cos at, Hy = Hsin @sin ut, H. = H cos 8, 





+ These arguments can also be applied to the case where any particle (chargėd or not) 
moves in a non-untiorm magnetic field, if its motion can be regarded as quasi-classical. The 
magnetic field, which varies as the particle moves along its path, can then be regarded simply 


as a function of time, and we can apply the same equations to the variation of the spin wave 
function. 
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and from (1) we obtain the equations 
yi! = twy(ys! cos 6+ petot sin 6), 


J? = rwy(wWiletut sin 0—4? cos 8), 
where wp = Hjh. The substitution 4! = e~tH#olg) p2? = eltwtg? converts these equa- 
tions into linear equations with constant coefficients, whose solution gives 
ppl = ettepet 1/24 ege-t), 


E 
Y2 = Zwyetutl? sin 6 ls Am2 oo" | 
Q+w+22wy cosl Q—w—2Zwy cos 


where 
Q = V[(w+2wy cos 6)? +4wp" sin?6]. 


§115. The current density in a magnetic field 


We shall now derive the quantum-mechanical expression for the current 
density when a charged particle moves in a magnetic field. 
We start from the formulat 


8H = —(1/c) [i-dadV; (115.1) 


this determines the change in the Hamilton’s function of charges distributed 
in space when the vector potential is varied.{ In quantum mechanics this 
formula must be applied to the mean value of the Hamiltonian of the charged 
particle: 


A= | Y*[(p—eA/c)?/2m—(u/s)H . SV dV. (115.2) 


7 
Effecting the variation and bearing in mind that 6H = curl ôA, we find 


2 
LE Í ve] —- (p .5A+8A.P)+Ž ASA |Y avul f curl ôA. Y* SF dV. 
2mc mc? 


(115.3) 
The term in f. ôA is transformed by integration by parts: 


| E*B.3A €F aV = — ih| FVA Y) dV 
= ih | SA. VY" aV 


+ In this section, j denotes the electric current density, i.e. the particle flux density multi- 
plied by the particle charge e. i 

t Lagrange’s function for a charge in a magnetic field contains a term eV - A/c, or, if the 
charge is distributed in space, (1/c) f j. A dV. 

The change in the Lagrange’s function when A is varied 1s therefore 


Mi (lje) [ j-8A dV. 


An infinitely small change 1n Hamilton’s function is, however, equal to the change in 
Lagrange’s function, taken with the opposite sign (see Mechanics, §40). 
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(the integral over an infinitely distant surface vanishing in the usual way). 
The integration by parts is also used in the last term in (115.3), together 
with the well-known formula of vector analysis 


a.curlb = —div(axb)+b. curl a, 


The integral of the div term vanishes, so that we have 


l Wey. curlSAdV = Í SA. curl(¥*8Y) dV. 


The final result is 


ieħ e? 
sË = —— fea (FVT *—-Y* VY) av+—| A .dAYY* dVv— 
2mc mce? 


—(p/s) f SA. curl(Y*SF) dV. 


Comparing this expression with (115.1), we find the following expression 
for the current density: 
2 
j= lehigeeyy —¥*V¥]—- SAPO Pp + (u/s)c curl(Y’*s¥). (115.4) 
2m mec 
We emphasize that, though this expression contains the vector potential 
explicitly, it is nevertheless one-valued, as it should be. This is easily seen 
by direct calculation, recalling that the transformation (111.8) of the vector 
potential must be accompanied by the transformation (111.9) of the wave 
function. 


It is also easy to verify that the current (115.4) and the charge density 
p = e|'F|? satisfy, as they should, the continuity equation 


Op/et+divy = 0. 


The last term in (115.4) gives the contribution of the magnetic moment of 
the particle to the current density. It is c curl m, where 


m = (p/s)P*8Y = ProF (115.5) 


is the spatial density of the magnetic moment. 

The expression (115.4) is the mean value of the current. It may be 
regarded as a diagonal matrix element of the current density operator j. 
This operator is most simply written in the second quantization form, with 
‘F and ¥* replaced by operators Y and Y+ (and, according to the general 
rule, with '¥+ on the left of Y in each term). The non-diagonal matrix 
elements of this operator can be determined also: 


a teh i “ > 3 2 
a Oe EA n*) Em — Fn*V Fm] -L AF, Fn t 
771 me 


+(u/s)e curl(Y, *8¥,,). (115.6) 


CHAPTER XVI 


NUCLEAR STRUCTURE 


§116. Isotopic invariance 


THERE is as yet no complete theory of nuclear forces—that is, the forces which 
act between nuclear particles or nucleons and hold them together in the nucleus 
of an atom. In consequence, to describe nuclear forces it is still necessary to 
rely ori experiment to a much greater extent than would be needed if a con- 
sistent theory were available. 


The two tvpes of particle which are nucleons differ mainly in their electrical 
properties, the proton (p) having a positive charge, while the neutron (n) is 
neutral. They have the same spin }, and their masses are almost equal (1836-1 
and 1838-6 electron masses respectively). This similarity is no accident. 
Despite the difference in electrical properties, the proton and the neutron are 
very similar particles, and this similarity is of fundamental importance. 


It is found that, apart from the relatively weak electric forces, the forces 
of interaction between two protons are very similar to those between two 
neutrons. This is called the charge symmetry of nuclear forces.T 


In so far as this symmetry is maintained we can, in particular, say that 
systems of two protons (pp) and two neutrons (nn) have states whose properties 
are the same. Here, of course, it is important that protons and neutrons obey 
the same statistics (namely Fermi statistics) and so only states with the same 
symmetry of the wave functions ¥(r1, 01; r2, o2) are permissible for the pp 
and mn systems, namely those antisymmetrical with respect to a simultaneous 
interchange of the coordinates and spins of the particles. 


Charge symmetry is, however, only one of the manifestations of a still more 
far-reaching physical similarity between protons and neutrons, known as 
isotopic invariance.{ This leads to the existence of an analogy not only between 
pp and mn systems (obtained from one other by interchanging all protons 
and neutrons), but also between these and the pz system, which consists of 
different particles. There cannot be a complete analogy here, of course, 
since the possible states of the pn system, in which the particles are non- 
identical, are certainly not restricted to those with antisymmetrical wave 
functions. It is found, however, that among the possible states of the pn 
system there are some whose properties are almost exactly the same as 


+ It appears, in particular, in the similarity of the properties (binding energy, energy 
spectrum, etc.) of what are called mirror nuclei, i.e. those which differ in that the numbers of 
protons and neutrons are interchanged. 

{t Also called isobaric invariance in the literature. 
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those of systems of two identical nucleonst; these states are, of course, 
described by antisymmetrical wave functions (the remaining states of the 
pn system are described by symmetrical wave functions and do not occur in 
the pp and nn systems). 

Isotopic invariance, like charge symmetry, is valid only if electromagnetic 
interactions are neglected. Another reason why isotopic invariance is only 
approximately true is the slight mass difference between the neutron and the 
proton; if there were exact symmetry between neutrons and protons, their 
masses would of course be identical also. 

A convenient formalism may be used to describe the isotopic invariance. 
It follows naturally from the fact that isotopic invariance is equivalent to 
the possibility of classifying the states of a system of nucleons with respect 
to the symmetry of its coordinate-spin wave functions y, independent of the 
types of nucleons concerned. The required formalism must therefore enable 
us to define for the description of the states of the system a new quantum 
number which uniquely determines the symmetry of the functions y. A 
similar situation has already been encountered in connection with the pro- 
perties of a system of particles with spin 4. We have seen in §63 that, if the 
total spin S of such a system ts specified, then the symmetry of its coordinate 
wave function ¢ is uniquely determined, regardless of which of the two possible 
values ( + $) is taken by the component c of the spin of each particle. 

It is therefore reasonable that, for a formal description of isotopic in- 
variance, the neutron and the proton should be regarded as two different 
“charge states”? of one particle, the nucleon, differing in the value of the 
component of a new vector t, whose formal properties are analogous to those 
of the vector of spin 4. This new quantity, which is usually called the 
isotopic spin or isospin,|| is a vector in “isotopic space” &, n, & (which, of 
course, 1s not related in any way to real space). 

The component of the isotopic spin of a nucleon along the f-axis can take 
only the two values ry = +4. The value +4 is arbitrarily assigned to the 
proton and — tothe neutron.tt+ The isotopic spins of several nucleons add 
to give the total isotopic spin of the system in accordance with the same 
rules as for the addition of ordinary spins. The f-component of the total 
isotopic spin of the system is equal to the sum of the values of zz for the 
component particles. For a nucleus in which the number of protons (i.e. the 
atomic number) is Z, the number of neutrons N and the mass number 
A = Z+N we have 


Ti = Lr 


HZ a Se, (116.1) 


i.e. Tg gives the total charge of the system if the number of nucleons is fixed. 





t This was shown from an analysis of experimental data on the scattering of neutrons and 
protons by protons (G. Breit, E. U. Condon and R. D. Present 1936). 

t In reality this mass difference between the neutron and the proton is probably also electro- 
magnetic in Origin. 

, First used by W. Heisenberg (1932), and applied to the descrıption of isotopic invariance 
by B. Cassen and E. U. Condon (1936). 


tt The opposite assignment is also found in the literature. 
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It is therefore clear that there is a strict conservation of the quantity Tr, 
which simply expresses the conservation of charge. 

The absolute magnitude T of the isotopic spin of the system determines 
the symmetry of the “charge part” w of the wave function of the system, just 
as the total spin S determines the symmetry of the spin wave function. It 
therefore determines also the symmetry of the coordinate~spin (i.e. the 
ordinary) wave function y, since the total wave function of a system of 
nucleons (i.e. the product pw) must have a definite symmetry; as for all 
fermions, it must be antisymmetrical with respect to simultaneous inter- 
change of the coordinates, spins and ‘‘charge variables” 7, of the particles. 
The existence of a definite symmetry of the wave functions y of any system of 
nucleons is therefore expressed, in this treatment, by the conservation of the 
quantity 7. 

We can say, in other words, that isotopic invariance signifies the invariance 
of the properties of the system with respect to rotatidns in isotopic space. 
States differing only in the value of T; (with T and the remaining quantum 
numbers having given values) have identical properties. In particular, charge 
symmetry—the invariance of the properties of the system with respect to the 
replacement of neutrons by protons and vice versa, being a particular case 
of isotopic invariance, is described as invariance with respect to a simultan- 
eous change of sign of all the 7z, i.e. with respect to rotation in isotopic 
space through an angle of 180° about an axis lying in the &y-plane. 

It may be noted that the obvious violation of isotopic invariance by the 
Coulomb interaction ,1s also formally evident from this treatment. The 
Coulomb interaction depends on the charge, i.e. on the -components of 
the isotopic spin, which are not invariant with respect to rotauons in £n€- 
space. á 
. Let us consider, for example, a system of two nucleons. Its total isotopic 
spin can take the values T = 1 and T = 0. For T = 1, the possible values 
of the component T, are 1, 0, — 1. According to (116.1), the corresponding 
charge values are 2, 1, 0, i.e. a system with T = 1 may be pp, pn or nn. The 
charge part w of the wave function with T = 1 is symmetrical (just as a 
symmetrical spin function corresponds to a spin S = 1; cf. §62). Hence 
states with antisymmetrical ordinary wave functions ys correspond to the 
value T = 1. For T = 0 we can only have T; = 0, and the corresponding 
function w is antisymmetrical; this therefore relates to states of the pm 
system with symmetrical wave functions y. 

The isotopic spin corresponds to an operator = which acts on the charge 
variable 7; in the wave function, just as the spin operator $ acts on the spin 
variable o. By virtue of the complete formal analogy between the two, 
the operators ?,, 7,, 7, are given by the same Pauli matrices (55.7) as the oper- 
ators fz, Sy, Sz- 

Here we may note some combinations of these operators which have a 
simple and evident meaning. The sum 


oll 
m=i (o i 
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is an Operator which, acting on a neutron wave function, converts it into a 
Proton wave function, and acting on a proton wave function gives zero. 


Similarly, the operator 
(e o) 
#.=7,-i7, = 
TE Ty 1 0 


converts a proton into a neutron and “annihilates” a neutron. Finally, the 


operator 
} l 0 
+7 = 
"RG 4 


leaves a proton wave function unchanged and annihilates a neutron; on 
multiplication by e, it may be called the nucleon charge operator. 

We shall also show that the operator Ê of the interchange of two particles 
may be expressed in terms of the operators 71, t2 of their isotopic spins. 
By definition, the result of the action of the interchange operator on the wave 
function (ri, 61; re, o2) of the system of two particles consists in interchang- 
ing their coordinates and spins, i.e. interchanging the variables ri, c} and 
ro, og. The eigenvalues of this Operator are +1, and occur when it acts on a 
symmetrical or antisymmetrical function 4: 


Piisym = faym; Pant = — Want. (116.2) 


We have seen above that the functions Ysym and yant correspond to charge 
functions wr with values of the total isotopic spin T = 0 and T = 1. Hence, 
in order to put the operator P in a form in which it acts on charge variables, 
it must have the properties 


Pp = ees a5 (116.3) 


These conditions are satisfied by the operator Late, as 1s easily seen by 
noting that wp is the eigenfunction of the operator T? corresponding to the 
eigenvalue 7(7+1). Finally, writing T = t,+ 2 and using the fact that Ti 
and t, have the same definite values t(r+1) = 3, we find the required ex- 
pressiont 


P =1-T? = -}—22,.45, (116.4) 


For the matrix elements of different physical quantities in a system of 
nucleons there are certain selection rules for the isotopic spin (L.A. Radicati 
1952). Let F be some quantity (of any tensor rank) having the property of 
additivity, in the sense that its value for the system is equal to the sum of its 


values for the individual nucleons. We write the operator of such a quantity 
as 


r= Efp+ Ufn, 


where the summations are over all protons and all neutrons in the system. 


re 





t An operator of this form derived trom the ordinary spins of the particles has already been 
met in §62, Problems, 
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This expression can be written in the tdentical form 


F = X(t+ tft X($— tfn 
=42(fotfn)t+ Elfo- fni (116.5) 


where the summation in each term is over all nucleons (both protons and 
neutrons). The first term in (116.5) is a scalar ; the second is the (-component 
of a vector in isotopic space. The same selection rules therefore apply to them, 
with respect to the isotopic spin, as to scalars and vectors in ordinary space 
with respect to the orbital angular momentum (see §29): the isotopic scalar 
allows only transitions without change of T; the -component of the isotopic 
vector has matrix elements only for transitions in which AT = 0 or +1, 
and in addition transitions with AT = 0 are forbidden between states with 
Tz = 0, i.e. systems with the same number of neutrons and protons; the 
latter rule follows from the fact that the matrix element of a transition with 
AT = Ois proportional to Tẹ (see (29.7)). 

For example, for the dipole moment of the nucleus the quaniities fp are 
the products er, and fn = 0. The first term in (116.5) ts then 


deXr = (e/2m) Lmr, 


is therefore proportional to the radius vector of the centre of mass, and can 
be made to vanish by a suitable choice of the origin. Thus the dipole moment 
of the nucleus reduces to the -component of the isotopic vector. 


§117, Nuclear forces 


The principal characteristic of the specifically nuclear forces which act 
between nucleons is their short range of action: they decrease exponentially 
at distances of the order of 10-13 cm. 

In the non-relativistic limit we can say that nuclear forces are independent 
of the velocities of the nucleons and have a potential; the velocities of the 
nucleons in the nucleus are about one-quarter of the velocity of light (see 
below). The potential energy U of the interaction of two nucleons depends 
not only on the distance r between them but also quite strongly on their spins.T 
The precise dependence on r could, of course, be established only by a 
consistent theory of nuclear forces. The nature of the spin dependence, 
however, can be found from simple considerations based on the properties of 
spin operators. 

We have at our disposal only three vectors on which the interaction energy 
U can depend: the unit vector n in the direction of the radius vector between 
the two nucleons, and their spins sı and sz. According to the general proper- 
ties of an operator of spin 4, any function of it reduces to a linear function 
(§55). It must also be taken into account that the product n.s is not a true 
scalar but a pseudoscalar (since n is a polar vector and s an axial vector). 





+ In this respect the interaction of nucleons differs considerably from the interaction of 
electrons, for which the spin-spin interaction is purely relativistic and is small (in atoms). 
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Thus it is evident that only two independent scalar quantities linear in each 
of the spins can be constructed from the three vectors n, sı, $2, namely 
S1.S2 and (n.s1) (n.s2).t 

Consequently, the operator of the interaction of two nucleons, as regards 
its dependence on the spins, can be written as the sum of three independent 
terms: 


U ord = U(r) + Ue(r)(§1-82) + U3(r)[3(81.m)(62.n) = 81.89], ( 1 17. 1) 


of which two depend on the spins and one does not. The third term is here 
written in a form which gives zero on averaging over the directions of n. 
The forces described by this term are usually called tensor forces. 

In (117.1) we have used the suffix ord (for “‘ordinary’’) in order to emphasize 
the fact that this operator does not affect the charge state of the nucleons. 
There is another possible interaction which converts a proton into a neutron 
and vice versa. The operator of this “exchange” interaction differs in form 
from (117.1) by the presence of the particle interchange operator (116.4): 


U awe = { U,(r) + Us(r)(S1.82) + U¢(r)[3(§1.n)(S2.n) — §1.82]}P. (1 17.2) 
The total interaction operator is the sum 


U = ised + Ose. (117.3) 


Thus the interaction of two nucleons is described by six different functions 
of the distance between them. All these terms are in general of the same 
orcer of magnitude. f 

The spin operators appearing in (117.1) and (117.2) can be expressed in 
terms of the total-spin operator 8. By squaring the equations§ = 8,48» 
and S.n = §).n+8s.n and using the results gr = 8? = 4, (81.n)? = (S2.n)? 


= + (see (55.10)), we find 
$1.82 = #(S’—3), (81-n)(82-n) = 4[(S.n)?—-4). (117.4) 


The operator $2 commutes with the operator §, and so the interactions 
described by the first two terms in (117.1) and (117.2) conserve the total 
spin vector of the system. The tensor interaction contains the operator 
(S.n)2, which commutes with the square §2 but not with the vector § itself. 
In consequence, only the magnitude of the total spin is conserved, not its 
direction. í 

The total spin S of a system of two nucleons can take the values 0 and 1, 
as can the total isotopic spin T. Hence all possible states of this system fall 
into four groups with various pairs of values of S and T. For states in each 





t Here it is assumed that the nuclear forces are invariant under spatial inversion, i.e. that 
they cannot include pseudoscalar quantities. There are at present no experimental results to 
disprove this assumption. 

t It may also be mentioned that the interaction dependent on the velocities of the nucleons, 
in an approximation linear with respect to these velocities, is described by an operator of the 
form [¢i(r)+ ¢2(r)P]L .S, where L = r xp is the orbital angular momentum of the relative 
motion of the nucleons, p the linear momentum of this motion, and S = s, +S; this operator 
contains two functions of r. Terms of the form p.n and S. n are excluded by 


. i : the require- 
ments of invariance under inversion and under time reversal. 
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of these groups there is an interaction operator of the form 4(r) (for S = 0) 
or A(r) + B(r) [(S.n)2— 4] (for S = 1), to which the general operator (117.3) 
reduces in these cases (see Problem 1).t 

For given values of S and T the states of the system are classified with 
respect to the values of the total angular momentum J and the parity. As 
we know, the values T = 0 and T = 1 correspond to the states with sym- 
metrical and antisymmetrical wave functions Ņ respectively. Since, on the 
other hand, the value of S determines the symmetry of the wave function 
with respect to the spin variables (symmetrical for S = | and antisymmetrical 
for S = 0), it is clear that, if the two numbers S and T are specified, the 
symmetry of the wave function with respect to the space variables (i.e. the 
parity of the state) is also determined. Evidently the states of the system with 
isotopic spin T = 0 can only be even triplets (S = 1) or odd singlets (S = 0), 
while those with isotopic spin T = 1 are odd triplets or even singlets. 

Since the spin, as a vector, is not conserved, the orbital angular momentum 
also need not in general be conserved; only the sum J = L+S is conserved. 
Nevertheless, the magnitude L may be conserved simply because specified 
values of J, S and the parity (or J, S and T) may be compatible with only 
one particular value of L (the parity of a system of two particles, it will be 
remembered, is (—1)£). For example, an odd state with S = 1, J = 1 can 
only have L = 1, i.e. it is 3P}. In other cases two different values of L may 
correspond to given values of J, S and the parity, so that L is not conserved. 
For example, in an odd state with S = 1, J = 2 we can have L = 1 or 
L = 3, ie. it is a superposition 3P +3F2. 

Thus we arrive at the following possible states of a system of two nucleons 
(the signs + indicating the parity): 
for T = 1: 2Po-,°Pi-, GPa ke) eis 3 5 

1Sot,1Det,1Gaqt, ... 5 
for T = 0: (2S, +8D})*,3D9", (3=D3+°G3)*, ... ; 
1P,-,1F3-, ee 


Paa 


Nuclear forces are not in general additive. This means that the interaction 
in a system of more than two nucleons does not reduce to a sum of inter- 
actions between each pair of particles. It seems, however, that ternary and 
higher interactions are relatively unimportant in comparison with binary 
interactions, and so, in discussing the properties of complex nuclei, we can 
to a considerable extent take as basis the properties of binary interactions. 

Experimental results concerning nuclei show that, as the number A of 


+ The experimental results concerning the properties of the deuteron show that for T = 0, 
S = 1 the nucleon interaction involves a strong attraction with a deep ‘potential weli”? (the 
presence of tensor forces makes it difficult to formulate this fact in terms of properties of the 
functions A(r) and B(r)); in addition, it follows from the sign of the observed quadrupole 
moment of the deuteron that in this state the coefficient B(r) in the tensor forces is negative. 
From nucleon scattering results it follows that for T = 1, S = O there is also an attraction, 
but one which is weaker and, in particular, does not lead to the formation of a stable system 


of two particles. 
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particles increases, the system of nucleons begins to behave like a macro- 
scopic “nuclear matter”, whose volume and energy increase in proportion to 
A (apart from effects due to the Coulomb interaction of protons and the 
existence of a free surface of the nucleus). The property of nuclear forces 
which gives rise to this phenomenon is called saturation. 

The existence of this property imposes certain restrictions on the functions 
Uj, ...,Ug which determine the binary interactions of nucleons. Let us 
suppose that all the particles are concentrated in a volume whose dimensions 
are of the order of the radius of action of nuclear forces. Then every pair of 
particles interact. If there is a configuration of certain nucleons (and an 
orientation of their spins) for which attractive forces act between every 
pair, then the potential energy of such a system is negative and proportional 
to A?; the kinetic energy is positive and proportional to 45⁄3, a smaller 
power of A.t It is clear that under such conditions a sufficiently large 
number of nucleons will in fact be concentrated in a small volume independ- 
ent of A, i.e. will not form nuclear matter. The condition for saturation of 
nuclear forces must therefore be expressed as the conditions for the absence 
of configurations leading to a negative interaction energy proportional to 
A? (see Problem 2). 

The proportionality between the volume of nuclear matter and the number 
of particles is expressed by a relation of the form 


R= Ais, (117.5) 


which connects the radius R of the nucleus and the number A of particles 
in it. Experimental results (on the scattering of electrons by nuclei) lead to 
the value ro = 1-1 x 10-13 cm. 

We may determine the limiting momentum of nucleons in nuclear matter 
(cf. §70). The volume of phase space corresponding to particles in unit 
volume of physical space and with momenta p< po is 47p3/3. Dividing by 
(27h)°, we obtain the number of “cells” in each of which two protons and 
two neutrons can be simultaneously. Putting the number of protons equal 
to the number of neutrons, we obtain 4(47/3)(po/27h)8 = A/V, where V is 
the volume of the nucleus. Substitution of (117.5) gives 


po = (3n2A/2V)/3h = (9r)1/3h/2rp 
= 1-4 x 10-14 g. cm/sec. 


The corresponding energy p2/2m , where mp is the nucleon mass, is ~ 40 
MeV, and the velocity po/my =% 4c. 


PROBLEMS 


PROBLEM 1. Find the operators of the interaction of two nucleons in states with definite 
values of S and T. 


t The density n at which the particles are concentrated ine 
to their number 4, and the kinetic energy of each Particle is pr 
The total kinetic energy is therefore ~ 4.423. 


given volume is proportional 
oportional to n?/? (cf. (70.1)). 
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SOLUTION. The required operators Ű sr are obtained from the general expression (117.1) 
—(117.3), using (116.3) and (117.4): 


Uoo = Uy — 3U 2+ Ua— iUs, 

Uo. =U1—}U2— Ua + 3Us, 

Uyo =U, +2U2+ Ua+}4Us+ 
+3(U3+ Us)(3(S.n)2—2], 

Uy = Uy +}U2—Uy—1Us+ 
+$(U3— Us)(3($.n)?~ 2). 


PROBLEM 2. Find the conditions for the saturation of nuclear furces, assuming tensor 
forces absent. The radii of action of forces of all other types are supposed equal. 


SOLUTION. Let us consider some extreme cases (between which lie all other possible 
cases) for the state of a system of A nucleons, and write down the conditions for the inter- 
action energy of an “average” pair of nucleons in this system to be positive. 

Let the total spin and the isotopic spin of the nucleus have the greatest possible values: 
Soue = Toue = $A (when all the particles in the system are protons with their spins parallel). 
Then for each pair of nucleons we have S = T = 1, and the condition is 


Uy > 0. (1) 


Next, let Tauc = 44, Snuc = 0. Then for each pair of nucleons T = 1, and the mean value 
of sz for an individual nucleon is zero. The latter result means that the nucleon can have 
se = 4 and sz = —4 with equal probability; under these conditions the probabilities that a 
pair of nucleons are in states with S = 0 or 1 are respectively } and ł (being proportional to 
the number 2S +1 of possible values of Sz). The condition for the mean energy of the pair 
to be positive is therefore 


}ŁUo + Uj > 0. (2) 
Similarly, a discussion of the state with Tnuc = 0, Snuc = 4A gives the condition 
1Uj0+ 2U 1 > 0. (3) 


In a state with Tnuc = Snuc = 0, the probability for a pair of nucleons to have S = T = 1 
is į . 3, that for T = 1, S = 0 is Ẹ . 4, and so on. Hence we find the condition 


9 3 1 
—U y+ — Uy + Vox) + — Uo > 0. 4 
ig 12 A 10+ Uor) Tii (4) 


Finally, let the system consist of 44 protons and $A neutrons, with the spins of all the 
protons in one direction and the spins of all the neutrons in the other direction. An individual 
nucleon can with equal probability be p or n, i.e. have 7¢ = $ or qg = —4; the probability 
for a pair of nucleons to have T = 01st. Here one of the pair of nucleons is p and the other 
n, and hence Sz = 0. This value of Sz can occur with equal probability from states with 
S = Qand S = 1. Consequently the probabilities for the pair to be in the states with T = 0, 
S =0 and T=0, S =1 are each } .4 = 4. The probability of the state with T = 1, 
S = 0 is the same, and the remaining $ relates to the state with T = S = 1. Thus we have 
the condition 


4(Uoo+ Uoi + Uio) + gUn > 0. (5) 


The inequalities (1)—(5) form the required set of conditions for the saturation of nuclear 
forces. 


§118. The shell model 
Many properties of nuclei can be well described by means of the shell 
model, which is basically similar to the structure of the electron shells of an 
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atom. In this model each nucleon in the nucleus is regarded as moving in 
a self-consistent field due to all the other nucleons; owing to the small range 
of action of nuclear forces, this field decreases rapidly outside the volume 
bounded by the “‘surface”’ of the nucleus. Accordingly the state of the nucleus 
as a whole is described by specifying the states of the individual nucleons. 

The self-consistent field is spherically symmetrical, and the centre of 
symmetry is, of course, the centre of mass of the nucleus. The following 
difficulty arises here, however. In the self-consistent field method, the wave 
function of the system is constructed as the product (or the appropriately 
symmetrized sum of the products) of the wave functions of the individual 
particles. Such a function, however, does not keep the centre of mass fixed: 
although the mean velocity of the centre of mass calculated from this function 
is zero, it gives a finite probability of non-zero values of the velocity. 

The difficulty can be avoided by first eliminating the motion of the centre 
of mass in calculating any physical quantity by means of the wave functions 
(ri, ..., ra) of the self-consistent field method. Let f(ti,pi) be some physical 
quantity, a function of the coordinates and momenta of the nucleons. Then, 
in calculating its matrix elements by means of the functions y, we must, 
without changing ¢(r;), alter the arguments of the function f as follows: 


ři >r;—R, pi>pi—P/A, (118.1) 


where R is the radius vector of the centre of mass of the nucleus, A the 
number of particles in it, P the momentum of its motion as a whole; the 
second change in (118.1) corresponds to subtracting the velocity V of the 
centre of mass from the velocities v; of the nucleons, the momentum P 
being related to V by P = AmpV (S. Gartenhaus and C. Schwartz 1957). 

For example, the dipole moment operator of the nucleus is d = e2rp, 
where the summation is over all protons in the nucleus. To calculate the 
matrix elements in the self-consistent field method, this operator must be 
replaced by e£(rp— R). The coordinates of the centre of mass of the nucleus 
are 


1 
R = — (2 ZF), 
ore 


where the summation is over all protons and neutrons. Since the number of 
protons in the nucleus is Z, the dipole moment operator must finally be 
changed thus: 


Z Z 
ES Pa Een z 
eXtp >e(1 z )=tp = Ern. (118.2) 


The protons appear here with an “effective charge’ e(1—Z/A) and the 
neutrons with a “charge?” —eZ/A. It may be noted that the relative order of 
magnitude of the resulting correction terms in the calculation of the dipole 
moment 1s seen from (118.2) to be of the order of unity. The corrections in 








t For electrons in an atom this difficulty did not arise, because the centre of mass coincided 
in position with the fixed heavy nucleus, and was therefore necessarily at rest. 
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the calculation of the magnetic and higher electric multipole moments are 
easily found to be of relative order 1/A. 

In the non-relativistic approximation the interaction of a nucleon with the 
self-consistent field is independent of the spin of the nucleon: such a depend- 
ence can be given only by a term proportional to §.n, where n is a unit 
vector in the direction of the radius vector r of the nucleon, and this product 
is a pseudoscalar, not a true scalar. 

A dependence of the nucleon energy on the spin appears, however, when 
relativistic terms depending on the velocity of the particle are taken into 
account. The largest of these is the term linear in the velocity. From the 
three vectors s, n and v a true scalar n x v.s can be formed. The spin-orbit 
coupling operator of the nucleon in the nucleus is therefore 


Pa = —d(r)nx¥.8, (118.3) 


where ¢(r) is some function of 7; see also the third footnote to §117. Since 
mpr x v is the orbital angular momentum Äl of the particle, the expression 
(118.3) can also be written as 


Pa= —f(ris, (118.4) 


where f = fid/rmp. It should be emphasized that this interaction is of the 
first order in v/c, whereas the spin-orbit coupling of an electron in an atom 
is a second-order effect (§72). This difference is due to t’.e fact that nuclear 
forces depend on the spin even in the non-relativistic approximet:on, whereas 
the non-relativistic interaction of electrons (Coulomb forcés) is not spin- 
dependent. 

The energy of the spin-orbit interaction is mainly concentrated near the 
surface of the nucleus, i.e. the function f(r) decreases inside the nucleus. 
This is because, in infinite nuclear matter, there would be no such interaction 
at all, as is clear from the fact that, the system being homogeneous, there is 
no preferred direction in it which could be that of the vector n. 

The interaction (118.4) brings about a splitting of the nucleon level with 
orbital angular momentum / into two levels with angular momenta j = U4. 


Since 


l.s =} for j =1+4, 
= —}(1+1) for j = l—$ 


(118.5) 


(according to formula (31.3)), the amount of this splitting 1s 


AE = Ey-4y— Ern 
= f(r)(1+4). (118.6) 
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Experiment shows that the level with j = 1+ 4 (the vectors 1 and s parallel) 
is below the level with j = /—4; this means that f(r) > 0. 

The spin-orbit coupling of a nucleon in the nucleus is relatively weak in 
comparison with its interaction in the self-consistent field. It is, nevertheless, 
in general large compared with the energy of the direct interaction of two 
nucleons in the nucleus, on account of the more rapid decrease of the latter 
with increasing atomic weight. 

This relation between the energies of the various interactions has the 
result that the classification of the nuclear levels must be of the jj coupling 
type: the spins and orbital angular momenta of the various nucleons are 
added to give the total angular momenta j = 1+s, which are definite 
quantities, since the relation between | and s is not affected by the direct 
interaction between the particles (M. Gédppert-Mayer 1949; O. Haxel, 
J. H. D. Jensen and H. E. Suess 1949)¢. The vectors j of the individual 
nucleons are then added to give the total angular momentum J of the nucleus 
(usually called simply the nuclear spin, as if the nucleus were an elementary 
particle). In this respect the classification of nuclear levels differs essentially 
from that of atomic levels: in the electron shells of the atom, the relativistic 
spin—orbit coupling is in general small in comparison with the direct electric 
and exchange interactions, and so the level classification is usually based 
on LS coupling. 

The state of each nucleon in a nucleus, however, 1s described by its angular 
momentum j and its parity. Although the vectors | and s are not separately 
conserved, the absolute magnitude of the orbital angular momentum of the 
nucleon is nevertheless definite. For the angular momentum j can arise 
either from a state with Z = j—4 or from one with /=j+4. For a given 
(half-integral) j, these two states have different parities (—1)', and so, if 
j and the parity are specified, the quantum number / is determined also. 

The states of nucleons with given / and 7 are customarily numbered (in 
order of increasing energy) by the “principal quantum number” n, which 
takes integral values starting from 1.[ The various states are denoted by the 
symbols 1s} 1p;, Ip; etc., where the figure before the letter is the principal 
quantum number, the letters s, p, d, ... indicate as usual the value of Z, and the 
suffix is the value of j. Not more than 2j+ 1 neutrons and the same number 
of protons can simultaneously be in a state with given values of n, l and j. 

The states of the nucleus as a whole (in a given configuration) are 
customarily described by a figure giving the value of J and the sign + or — 
indicating the parity of the state (the latter being determined in the shell 
model by the parity of the algebraic sum of the values of / for all the nucleons). 

From an analysis of experimental results concerning the properties of 
nuclei it is possible to derive a number of regularities in the positions of the 
nuclear levels. First of all, it is found that the energy of the nucleon increases 
with the orbital angular momentum /. This rule arises because, when Z 


+ The coupling ts closer to LS only for the lightest nuclei. 
t Unlike the usual procedure for electron levels in an atom, where the number 7 takes values 
starting from /+1. 
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increases, so does the centrifugal energy of the particle, and its binding 
energy is therefore reduced. 

Next, for a given value of / the level with j = 7+4 (i.e. that which corres- 
ponds to parallel vectors 1 and s) lies below the level with J = l—4. This 
rule has already been mentioned in connection with the properties of the 
spin-orbit coupling of the nucleon in the nucleus. 

The following rule relates to the isotopic spin of nuclei. The component 
Tr of the isotopic spin is known to be determined by the atomic weight and 
atomic number of the nucleus (see 116.1)). For a given value of T, the 
absolute magnitude of the isotopic spin can take any value such that 
T >| T,|. Usually, the ground state of the nucleus has the smallest of these 
possible values of the isotopic spin, 1.e. 


Ter = | Tg] = N-Z). (118.7) 


This rule is due to a property of the neutron—proton interaction, namely that 
in the np system the state with isotopic spin T = 0 (the deuteron state) has a 
greater binding energy than the state with T = 1; see the fourth footnote to 
§117. 

We can also formulate certain rules relating to the spins of the ground 
states of nuclei. These rules determine the way in which the angular momenta 
j of the individual nucleons add to give the total spin of the nucleus. They 
represent the tendency of protons or neutrons in like states in the nucleus 
to “pair off” with opposite angular momenta; the binding energy of such 
pp and nn pairs is of the order of 1 or 2 MeV. r 

This phenomenon has, in particular, the result that, if the nucleus contains 
even numbers of both protons and neutrons (an even—even nucleus), then 
the angular momenta of all the nucleons balance in pairs, so that the total 
angular momentum of the nucleus ts Zero. 

If the nucleus contains an odd number of protons or neutrons, however, 
with all nucleons outside closed shells being in like states, the total angular 
momentum of the nucleus is usually equal to that of one nucleon, as if a 
single nucleon were left over after the pairing of all possible pairs of protons 
and of neutrons (the total angular momenta of complete shells being 
necessarily zero). 

For odd-odd nuclei (Z odd and N odd) there is no sufficiently general 
rule to determine the spin of the ground state. 

A discussion of the actual manner in which the shells are filled in nuclei 
would require a detailed analysis of the available experimental results, and is 
outside the scope of this book. Here we shail add only some general remarks. 

In studying the properties of atoms we have seen that their electron states 
can be divided into groups such that the binding energy of the electron 


decreases as each group is completed and the next is begun. A similar 
situation occurs for nuclei, the nucleon states being distributed among the 
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following groups: 


Nucleons 
1sy/2 Z 
13/2; 121/2 6 
1ds,2, 1d3;2, 251/2 12 > (118.8) 
Lf7j2, 2paj2, Ifsye, 2Pu2, 189/2 30 | 
2d5;2, 1g7/2, Lhi1/2, 243/2, 351/2 32 
2f7/2, 1hg 2, 1113/2, 2fs2, 3p3/2, 3prye 44 


For each group the total number of proton or neutron vacancies is shown. 
According to these numbers the occupation of a group is completed when the 
total number Z of protons or N of neutrons in the nucleus is equal to one of 
the numbers 

Z,, B20, 50,82, 120. 


These are commonly called magic numbers.t 

The “doubly magic” nuclei, in which both Z and N are magic numbers, 
are particularly stable. In comparison with adjacent nuclei they have an 
unusually small affinity for a further nucleon, and their first excited states are 
unusually high.f 

The various states in each of the groups (118.8) are listed in approximate 
order of successive occupation in the series of nuclei. In reality, however, 
considerable irregularities are observed in the occupation process. Moreover, 
it must be borne in mind that, in heavy nuclei not close to the magic numbers, 
the distances between the various levels may be comparable with the 
“pairing energy”, and the concept of individual states of components of a 
pair is then itself largely meaningless. 

We may make some comments regarding the calculation of the magnetic 
moment of the nucleus in the shell model. By this we mean, of course, the 
magnetic moment averaged with respect to the motion of the particles in the 
nucleus. This mean magnetic moment is evidently in the direction of the 
nuclear spin J, which is the only preferred direction in the nucleus; its 
operator is therefore 


Ê = pog), (118.9) 


where po is the nuclear magneton and g the gyromagnetic factor. The 
eigenvalue of the projection of this moment is pz = pogMy. Usually 
(cf. (111.1)) the magnetic moment p of the nucleus is taken to be simply 
the maximum value of its projection, i.e. p = pogJ. In this notation 


+ The states 1f,,. (8 vacancies) are sometimes placed in a separate group, since the number 
28 also has some magic properties. 


t These nuclei include Hes, sO,, Cazo, Pbi; the *He nucleus is incapable of adding 
another nucleon. 
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The magnetic moment of the nucleus is composed of the magnetic moments 
of the nucleons outside closed shells, since the moments of nucleons in 
completed shells cancel out. Each nucleon produces in the nucleus a magnetic 
moment which consists of tvo parts: a spin part and (in the case of the 
proton) an orbital part, i.e. is represented by the sum g,8+ g7l. (Here and 
henceforward we omit the factor po, assuming, as is usual, that magnetic 
moments are measured in units of the nuclear magneton.) The spin and 
orbital gyromagnetic factors are g; = l, gs = 5-585 for the proton and g; = 0, 
gs = — 3-826 for the neutron. 

After averaging with respect to the motion of the nucleon in the nucleus, 
its magnetic moment becomes proportional to j; writing it in the form gy, 
we have 


gj = gs8 tel 
= Ker+gs)j +He—gs)(l—8). 


Multiplying both sides of this equation by J = 1+ 8 and taking eigenvalues, 
we obtain 


gait 1) =HartEss+ 1) +lees 1)—ss+ 1], 
and, putting s = 4,7 =/+}, 


Le Bi 
for; = l+}. 118.11 
21+1 CLs ne ( ) 





BISANE 


With the above values of the gyromagnetic factors, this gives for the magnetic 
moment of the proton py = gy 


2:29\ . i 
wali ey teint 
jtl 


(118.12) 
up =j+2-29 forj = 1+4, 
and for that of the neutron 
1-91. f 
En =- for j = I—ł, 
jt+l 
(118.13) 
un = —1-91 forj =1+4 


(T. Schmidt 1937). 

If there is only one nucleon outside the closed shells, formulae (118.12) 
and (118.13) give directly the magnetic moment of the nucleus. For two 
nucleons, the addition of their magnetic moments is also elementary (see 
Problem 1). When the number of nucleons exceeds two, the averaging of the 
magnetic moment must be effected by means of the wave function of the 
system, constructed in the appropriate manner from the wave functions of 
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the individual nucleons. If the nucleon configuration and the state of the 
nucleus as a whole are given, the wave function can be constructed uniquely 
in cases where only one state of the system with the given values of J and T 
can correspond to the given configuration (see, for example, Problem 3); 
otherwise, the state of the nucleus is a mixture of several independent 
states (with the same J and 7), and in general the coefficients in the linear 
combination which gives the wave function of the nucleus remain unknown.t 
Finally, we may mention that the existence of spin-orbit coupling of 
nucleons in the nucleus leads to the appearance of a certain magnetic moment 
of the protons in the nucleus, additional to (118.9) (M. Géppert-Mayer and 
J. H. D. Jensen 1952). The reason is that, when the interaction operator 
depends explicitly on the velocity of the particle, the case where an external 
field is present is obtained by replacing the momentum operator p by p—eA/c. 
Carrying out this replacement in (118.3) and using the expression (111.7) 
for the vector potential, we find that the Hamiltonian of the proton contains 

an additional term 
e 





ġ(r)—n x A.ŝ =f(r) FFA x (H x r).§ 


cmp 


e 
Sees x (§ x r).H. 


This term is equivalent to the appearance of an additional magnetic moment 
whose operator is 
e 


2C 





aaa = — 7 f(r)r x (xr) 





= rf(7){8—(8.n)n}. (118.14) 


PROBLEMS 


PROBLEM 1. Determine the magnetic moment of a system of two nucleons (with total 
angular momentum J = jı +ja), expressing it in terms of the magnetic moments py and pa 
of the two nucleons. 


SOLUTION. Similarly to the derivation of formula (118.11) we obtain 


H 2 2\(a—s2tj2+ 1 
5-548) 4(2-2)% XJ+) ) 
hn pe Aap J(4+1) 


PROBLEM 2. Find the possible states of a system of three nucleons with angular momenta 
J = 3;2 (and the same principal quantum numbers). 


SOLUTION. We proceed as in §67 when finding the possible states of a system of equivalent 
pe Each nucleon can be in one of eight states with the following pairs of values 
of (#3, Tr): 

(3/2, 1/2), (1/2, 1/2), (—1/2, 1/2), (—3/2, 1/2), 
(3/2, —1/2), (1/2, ~1/2), (~1/2, ~1/2), (—3/2, —1/2). 





t Note, however, that the “single-particle” calculation of nuclear magnetic moments is in 


practice fairly inaccurate. The pairs of values (118.12) and (118.13) are upper and lower 
limits rather than exact values of the moments. 
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Combining these states in groups of three different ones, we find the following pairs of 
values of (My, Tr).for the system of three nucleons: 
(772, 1/2), 2(5}2, 1/2), (372, 42); H2 TAZ). (172, 30y SCAf2,.. N2). 
(The number before the parenthesis indicates the number of such states; states with negative 
values of My and Tr need not be written out.) These correspond to states of the system with 
the following values of (J,7): 
(752; Ad2), (S25 192), G Se) es 2), T2 


PROBLEM 3. Determine the magnetic moment of the ground state of a configuration of 
two neutrons and one proton in ps; states (with the same n), taking account of isotopic 
invariance.t 


SOLUTION. The ground state of such a configuration has J = 3/2, and from the rule 
given in the text its isotopic spin has the minimum possible value T = |7;/ = 4. 

Let us determine the wave function of the system corresponding to the greatest possible 
value My = 3/2. This value can occur (when Pauli’s principle for two like nucleons is applied) 
for the following sets of values of mj for the nucleons p,n,n respectively: 

(3/2, 3/2, ~3/2), (3/2, 1/2, —1/2), (1/2, 3/2, —1/2), (—1/2, 3/2, 1/2). 


; DM. ; ees ¢ 
Hence the required wave function #777 is a linear combination of the form 


v 
kg 


yaz — aiae y e] N bye? 12 jy 


2-1/2 f2 =l} —L/2' — 1/2 
Sre lie —li2 eo fe 
+e nee elt ae TY teh (1) 


where [...] denotes the normalized antisymmetrized product (i.e. a determinant of the form 
(61.5)) of the wave functions ¢7/ of the individual nucleons. 


The function (1) must vanish under the action of the operators 


3 3 
T- = > Pa and jh = > jh; 
f«] i=l 


see §67, Problem. The operators 7!) convert the proton function of the ith nucleon to the 
neutron function, an” the latter to zero. It is therefore easily seen that the operator T- reduces 
the first term in (1) to a determinant with two identical rows, i.e. to zero, while the deter- 
minants in the three remaining terms become equal; thus we haye the condition 
b+c+d = 0. 

Next, for a single nucleon with j = 3/2 and various values of mj; we have, according to 


(27.12), 


jay? = 0, Jp VI, fay V2 = YN, jay 3/2 = yI, 
Hence we see that the action of the operator J, on the function (1) gives 


Oe aN 3(at+b—c) [ye See nlt 


+2e— AER OBE 


the change in sign of some terms is due to interchanging the rows of the determinant. The 
conditions for this expression to vanish are 


a+b-—-c=0, c—d=0, 
Together with the normalization condition for the function (1), these relations give 
a= 3/15, b= —2/, ls, c=d= 1/4/15. 


Since the mean value of the magnetic moment component of the proton (or neutron) in 
a state with given my is ppmy;j (or pnmy'j), we find that the mean value of the angular momen- 


+ The nucleus Li‘ has this configuration (outside the closed shell (15,,/2)‘). 
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tum of the system calculated by means of the wave function (1) 1s 
9 + 


p= pz = —Hp + — Hpt —{}np+ Fin) + 
15 15 15 


H 4 l 
e E E AN) 
15 3 15 


From formulae (118.12), (118.13) it follows that, for a nucleon in the py; state, pn = —1-91 
and pp = 3-79. Thus p = 3:03. 


PROBLEM +. Determine the magnetic moment of a nucleus in which all nucleons outside 
closed shells are in like states and the number of protons is equal to the number of neutrons. 


SOLUTION. Since, for N = Z, the component Tr of the isotopic spin is zero, diagonal 
matrix elements occur only for the isotopic-scalar part of the operator 


k = > gnint > Bri’ 
n P 


see the end of §116. Separating this part in accordance with formula (116.5), we find that 
it is 
Menten) j= Ngn tgn]. 


n.p 
The total mean magnetic moment of the nucleus is therefore (gn +gp)J. 


PROBLEM 5. Calculate the additional magnetic moment of a nucleon with angular momen- 
tum j, expressing it in terms of the spin-orbit splitting (118.6) (M. Göppert-Mayer and 
J. H. D. Jensen 1952). 

SOLUTION. The averaging of the angular part of the operator (118.14) (the expression in 


braces in that formula, which we denote by @) is effected by means of the formula derived 
in §29, Problem. The result is 


CONS (8.11 — (8.1) -37+ 1)8 
Ti+ Geass) a 


After complete averaging with respect to the motion of the nucleon, the mean value of o 
can only be in the direction of j, i.e. ¢ = aj, whence a = ĝ.);j?. Taking the component of 
the vector (2) in the direction of j (noting that the operator} commutes with 1.8), and taking 
eigenvalues of the quantities l.s, 1? etc., we easily find the following expression for the addi- 
tional magnetic moment of the nucleon (in units of the nuclear magneton): 
mR? fpi 
Haad = +f(r) 
i Re 4 +1) 








forj = 1+}, (3) 


where mp is the nucleon mass and R the radius of the nucleus. In the averaging of r2f, the 
factor r? is replaced by R? owing to the rapid decrease of f(r) inside the nucleus. The mean 


value f in (3) can be expressed in terms of the spin-orbit splitting by means of (118.6). 


§119. Non-spherical nuclei 


A system of particles in a spherically symmetric field cannot have a 
rotational energy spectrum; in quantum mechanics, the concept of rotation 
has no meaning for such a system. This applies to the shell model of the 
nucleus with a spherically symmetric self-consistent field considered in 
§117. 

The division of the energy of the system into “internal” and “rotational” 
parts has no precise meaning in quantum mechanics. It can only be approxi- 
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mate and is possible where, for physical reasons, the consideration of the 
system as an assembly of particles, moving in a given field which is not spheri- 
cally symmetric, is a good approximation. The rotational structure of the 
levels is then a consequence of taking into account the possibility of rotating 
this field with respect to a fixed system of coordinates. Such a case occurred, 
for example, in molecules, whose electron terms can be determined as the 
energy levels of a system of electrons moving in a given field of fixed nuclei. 

Experiment shows that the majority of nuclei in fact have no rotational 
structure. This means that the spherically symmetric self-consistent field 
is a good approximation for such nuclei, i.e. they are spherical in shape 
apart from quantum fluctuations. 

There exists also, however, a class of nuclei which have an energy spectrum 
of the rotational type; they lie approximately in the ranges of atomic weight 
150 < A < 190and A > 220. This property means that the approximation 
of the spherically symmetric self-consistent field 1s entirely inapplicable to 
such nuclei, and for them the self-consistent field must in principle be sought 
without any initial assumptions regarding its symmetry, in order that the 
shape of the nucleus should also be “‘self-consistently’”’ determined. Experi- 
ment shows that a correct model for nuclei of this type is given by a self- 
consistent field having an axis of symmetry and a plane of symmetry per- 
pendicular to it (i.e. having the symmetry of a spheroid). The concept of 
non-spherical nuclei has been most extensively developed in the work of 
A. Bohr and B. R. Mottelson (1952-3). 

It should be emphasized that we are concerned here with two qualitatively 
different classes of nuclei. This is seen, in particular, from the fact that 
nuclei are either spherical or else non-spherical with a “degree of deforr.a- 
tion” that is not small. - 

The occurrence of non-sphericity is favoured by the presence of incomplete 
shells in the nucleus, and the phenomenon of nucle: pairing also appears to 
be of considerable importance here. Closed shells, on the other hand, tend 
to give spherical nuclei. A characteristic example is the doubly magic nucleus 
Pb: owing to the marked completeness of its nucleon configuration, this 
nucleus (and also those adjoining it) is spherical, and this brings about 
a gap in the sequence of non-spherical heavy nuclei. 

The energy levels of a non-spherical nucleus consist of two parts: the levels 
of the “fixed” nucleus and the energy of its rotation as a whole. In even—even 
nuclei the intervals of the rotational structure of the levels are small in com- 
parison with the distances between the levels of the ‘‘fixed’’ nucleus. 

The classification of the levels of a non-spherical nucleus is in many ways 
similar to that for a diatomic molecule consisting of like atoms, since the 
symmetry of the field in which the particles (nucleons or electrons) move is 
the same in each case. We can therefore apply directly a number of the results 
obtained in Chapter XI.t 


+ It must be emphasized that we are referring to the analogy with the classification of 
levels of the diatomic molecule, not of the symmetrical top. For a system of particles moving 
in an axially symmetric field, the concept of rotation about the field axis has no meaning, like 
that of rotation about anv axis for a system in a centrally symmetric field. 
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Let us first consider the ciassification of states of the “fixed” nucleus. 
In a field with axial symmetry, only the component of the angular momentum 
along the axis of symmetry is conserved. Each state of the nucleus is there- 
fore described first of all by the value Q of the component of its total angular 
momentum,f which can be either integral or half-integral. The levels are 
described as even (g) or odd (u) according to the behaviour of the wave 
function when the coordinates of all the nucleons (with respect to the 
centre of the nucleus) change sign. 

In addition, for Q = 0 positive and negative states are distinguished, 
according to the behaviour of the wave function on reflection in a plane 
passing through the axis of the nucleus (see §78). 

The ground states of even-even non-spherical nuclei are OF (the zero 
indicating the value of Q), corresponding to zero angular momentum and the 
highest symmetry of the wave function. This is a result of the pairing of all 
the neutrons and protons. If the nucleus contains an odd number of protons 
or neutrons, however, we can consider the state of the “odd” nucleon in the 
self-consistent field of the even-even remainder of the nucleus. Here the 
value of {2 is determined by the component w of the angular momentum of this 
nucleon. Similarly, in an odd—odd nucleus the value of Q is obtained from the 
angular momentum components of the odd neutron and proton: 


Q = | wotan| P 


It should be emphasized at the same time that we cannot speak of definite 
values of the components of the orbital angular momentum and spin of the 
nucleon. The reason is that, although the spin-orbit coupling of the nucleon 
is small in comparison with the energy of its interaction with the self- 
consistent field of the remainder of the nucleus, it is not small compared 
with the distances between adjoining energy levels of the nucleon in that 
field, as it would have to be for perturbation theory to be applicable, so that 
the orbital angular momentum and the spin of the nucleon could, to a good 
approximation, be considered separately. t 

Let us now consider the rotational structure of a non-spherical nucleus. 
The intervals in this structure are small compared with the spin-orbit 
interaction of the nucleons in the nucleus. This corresponds to case a in 
the theory of diatomic molecules (§83). 

The total angular momentum J of a rotating nucleus is, of course, con- 
served. For given Q its magnitude J takes values from Q upwards: 


Tet, O81, ue? (119.1) 


see (83.2). An additional restriction on the possible values of J occurs for 





t By definition. Q>0 (just as the quantum number A is positive for diatomic molecules). It 
may be recalled that negative values of Q for diatomic molecules could arise only because Q 
was defined as the sum A+, and E can be either positive or negative, depending on the 
relative directions of the orbital angular momentum and the Spin. 


t In spherical nuclei this was still possible, owing to the simultaneous conservation of parity 
and angular momentum. 
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nuclei with Q = 0: in states Og and Ou the number J takes only even values, 
and in states Og and Ou only odd values (see §86). In particular, in the rota- 
tional levels of the ground term for even-even nuclei (07) the number J 
takes the values 0, 2, 4, .... 
The rotational energy of the nucleus is given by the formula 
k2 
Erot = HJ+1) (119.2) 


where I is the moment of inertia of the nucleus (about an axis perpendicular 
to its axis of symmetry); this formula corresponds to the similar expression 
in the theory of diatomic molecules (the term depending on J in (83.6)). 
The lowest level corresponds to the least possible value of J, i.e. J =Q. 

On account of (119.2) the rotational structure of the levels is described by 
certain interval rules which do not depend on the other characteristics of 
the level (for given Q). For instance, the components of the rotational 
structure of the ground term of an even-even nucleus (with J = 2, 4, 6, 8, ...) 
are at distances in the ratio 1:3-3:7:12... from the lowest level (J = 0). 

Formula (119.2), however, is insufficient for states with Q = à, which 
can occur in nuclei with an odd number of nucleons. In this case there is a 
contribution to the energy, comparable with (119.2), due to the interaction 
of the odd nucleon with the centrifugal field of the rotating nucleus. Its 
dependence on J can be found as follows. 

It is known from mechanics (Mechanics, §39) that the energy of a particle 
in a rotating coordinate system contains an additional term equal to the 
product of the angular velocity of rotation and the angular momentum of the 
particle. The corresponding term in the Hamiltonian of the nucleus can be 
written in the form 26K.é, where b is some constant, K the angular momen- 
tum of the remainder of the nucleus (excluding the last nucleon), and o the 
angular momentum of that nucleen. Here the latter must be understood in a 
purely formal sense; in reality, the angular momentum vector of the nucleon 
does not exist in the axial field of the nucleus. This sense is that of an 
operator analogous to the operator of spin 3, which gives transitions between 
states with values of the angular momentum component + 3, in accordance 
with the value Q = }.+ Since K = J —o, the eigenvalues of this operator are 


2bK . o = bf J(J+1)—K(K+1)—3]. 


Adding for convenience the constant $b, which is independent of J, we find 
that this quantity equals + b(J+4) when J = K+. 

This expression can be written (—1)¥~# b(J + +) if we use the fact that the 
angular momentum K of the even—even remainder of the nucleus is even. 


+ The specific property of the case Q = $ consists precisely in the existence of matrix 
elements of the energy perturbation for transitions between states differing only in the sign of 
the angular momentum component and therefore belonging to the same energy. This brings 
about a shift in energy even in the first approximation of perturbation theory. l ' 

The phenomenon concerned is analogous to the A-doubling of the levels of a diatomic 
manleni e with Q = 4 (§88). 
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Thus we have finally the following expression for the rotational enérgy of 
the nucleus with Q = ł4: 


hie 


Erot = J+ 1)+(— 112 +4) (119.3) 


(A. Bohr and B. R. Mottelson 1953). Note that, if the constant b is positive 
and sufficiently large, the level with J = 3/2 may lie below that with J = 4, 
i.e. the normal order of rotational levels (where the lowest level corresponds 
to the smallest possible value of J) will be altered. 

The moment of inertia of a non-spherical nucleus cannot be calculated as 
that of a solid of given shape. Such a calculation would be possible only if 
the nucleons moving in the self-consistent field of the nucleus could be 
regarded as not directly interacting. In reality, the pairing effect leads to a 
reduction in the moment of inertia, in comparison with the value for a rigid 
body. 

The magnetic moment p of a non-spherical nucleus consists of the magnetic 
moment of the “fixed” nucleus and that due to the rotation of the nucleus. 
The former (after averaging over the motion of the nucleons in the nucleus) 
is along the axis of the nucleus; denoting its value by p’, and the unit vector 
along the axis of the nucleus by n, we can write it in the form p’n. The mag- 
netic moment due to the rotation is (after the same averaging) along the 
vector J—Qn, the total angular momentum of the nucleus minus that of the 
nucleons in the “fixed nucleus”.t Thus 


B= pntg(J—On). (119.4) 


Here g, is the gyromagnetic factor for the rotation of the nucleus. Since the 
contribution to the magnetic moment in rotation comes only from the protons, 
we have 


&r = [p/p +n), (119.5) 
where In and Ip are the neutron and proton parts of the moment of inertia 
of the nucleus; for a system of protons only, gy = 1 simply. The ratio 


(119.5) is in general not the same as the ratio Z/A of the number of protons 
to the total mass of the nucleus. 


After averaging over the rotation of the nucleus, the magnetic moment 
is in the direction of the conserved vector J: 


Ê = pJ/J = (n’—Og,)i+g,J. 


As usual, we multiply both sides of this equation by J and take eigenvalues. 
In the ground state of the nucleus Q = J, and so 


w= (n' +8) +1). (119.6) 


+ This formulation can be used only if (2443 (see Problem 2), 
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PROBLEM 1. Express the quadrupole moment Q of a rotating nucleus in terms of the 
quadrupole moment Qo relative to axes fixed to the nucleus (A. Bohr 1951). 


SOLUTION. The operator of the quadrupole moment tensor of a rotating nucleus is given 
in terms of Qo by 


Qir = 8Qo( nen — $ò); 


this is a symmetrical tensor with zero trace, formed from the components of the unit vector 
n along the axis of the nucleus, and Qzz = Qo. The averaging with respect to the rotational 
state of the nucleus is effected similarly to the solution in §29, Problem (with the difference 
that nJ; =, not zero), and leads to an expression of the form (75.2) with 


3Q2—J(J+1) 
~ e TENE) 
For the ground state of a nucleus with 2 = J we obtain 
(2J—1)J 
A QI+3\ F410) 


As J increases, the ratio Q/Qo tends to 1, but only slowly. 


Q 


PROBLEM 2. Determine the magnetic moment in the ground state of a nucleus with 
Q = 4. 


SoLUTION. In this case the magnetic moment operator can be written by means of the 
operator 6 introduced in the text, in the form 


pL = 2y'S+2,K, K = j—@. 


The subsequent calculation is similar to that in the text. If the value J = 4 corresponds to 
the ground level of the nucleus (and K = J —}4 = 0), we have p = p’; if in the ground state 
J = 3/2 (and K = J+4 = 2), then p = (Sgr —3n')/3. 


PROBLEM 3. Determine the energies of the first few levels of the rotational structure of 
the ground state of an even-even nucleus having ellipsoidal symmetry. 


SoLUTION. The ground state of an even-even nucleus corresponds tothe most symmetrical 
wave function of the “fixed” nucleus, i.e. the function whose symmetry corresponds to the 
representation A of the group D2. There are therefore altogether 4J+1 (for even J) or 
4(J—1) (for odd J) different levels for a given value of J. For J = 2 they are given by formula 
(7) in §103, Problem 3, and for J = 3 by formula (8) in §103, Problem 4. 


§120. Isotopic shift 


The specific properties of the nucleus (finite mass, dimensions, spin) 
which distinguish it from a fixed point centre of a Coulomb field have a 
certain influence on the electron energy levels of the atom. 

One such effect is called the isotopic shift of levels, that is, a change in the 
energy of a level from one isotope of an element to another. In practice, 
of course, what is of interest is not the change in energy of one level but the 
change in the distance between two levels observed as a spectral line. For 
this reason we must in practice consider not the energy of the entire electron 
envelope of the atom but only the part due to the electron involved in the 
transition in question. 

In light atoms the isotopic shift is due mainly to the finite mass of the 
nucleus. When the motion of the nucleus is taken into account a term 


1 2 
(Zê) 
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appears in the Hamiltonian, where M is the mass of the nucleus and the pz 
are the momenta of the electrons.t The isotopic shift due to this effect is 
therefore given by the mean value 


a( = i: =) (Sp). (120.1) 
4 


calculated from the wave function of the relevant state of the atom (Mı and 
Mp being the masses of the nuclei of the isotopes). 

In heavy atoms the main contribution to the isotopic shift comes from the 
finite size of the nucleus. This effect is in practice appreciable only for the 
levels of an outer electron in the s state, since the wave function of the s state 
(unlike those of states with / 4 0) does not vanish as r 0, and so the 
probability of finding the electron “within the nucleus” is comparatively 
large. We shall calculate the isotopic shift for this case.] 

Let ¢(r) be the true electrostatic potential of the field of the nucleus, as 
opposed to the potential Ze/r of the Coulomb field of a point charge Ze. 
Then the change in the electron energy in comparison with its value in a 
purely Coulomb field Ze/r is given by the integral 





AE =—e Í ($— Zelry(rdV, (120.2) 


where (r) is the electron wave function; in the s state this function is 
spherically symmetric and real. Although the integration here is formally 
extended to all space, in practice the difference $— Ze/r in the integrand is 
zero except within the nucleus. The wave function of the s state tends to a 
constant limit as z -> 0 (see §32), and this constant value is practically reached 
even outside the nucleus. We can therefore take 4? outside the integral and 
replace y(r) by its value at r = 0, calculated for the Coulomb field of a point 
charge. 


For a further transformation of the integral we use the identity Ar? = 6 
and write (120.2) as 


AE = —}e?(0) f (d—Ze/r)/Ar?2.dV 


= — feys?(0) | r2/\(¢— Zelr)dV; 


in transforming the volume integral we have used the fact that the resulting 





+ In the centre-of-mass system of the atom, the sum of the momenta of the nucleus and the 
electrons is zero: Pnuet+ Zp: = 0. Their total kinetic energy is therefore 


53 
Pouc“ 


1 f 1 2 1 
2M Im aei (DP) +z, OPA 


t The calculation given below does not take account of relativistic effects in the motion of 
the electron near the nucleus, and is valid only if the condition Ze?/fic<€1 holds. 
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integral over an infinitely remote surface is zero. But A(1/r) = —4ròô(r), 
and 728(r) = 0 for all r. According to the electrostatic Poisson’s equation, 
Ad = —4np, and in this case p is the density of the electric charge distribu- 
tion in the nucleus. The final result is 


AE = 2ans2(0)Ze?r?, (120.3) 


where 
72 = (1/Ze) | pr?d V 


is the proton mean square radius of the nucleus; for a uniform distribution 
of protons in the nucleus, r? = 3R?/5, where R is the geometrical radius of 
the nucleus. The isotopic shift of the level is given by the difference of the 
expressions (120.3) for the two isotopes. 

In §71 an estimate has been given of (0), and it was shown to depend on 
the atomic number (assumed large) as V/Z. Hence the splitting (120.3) is 
proportional to R222. 


§121. Hyperfine structure of atomic levels 


Another effect in atoms due to the properties of the nucleus is the splitting 
of atomic energy levels as a result of the interaction of electrons with the 
spin of the nucleus. This is called the hyperfine structure of the levels. On 
account of the weakness of this interaction the intervals in the hyperfine 
structure are very small, even in comparison with those in the fine structure. 
Hence the hyperfine structure must be considered separately for each com- 
ponent of the fine structure. 

The spin of the nucleus will be denoted in this section (in accordance with 
the notation usual in atomic spectroscopy) by 2, the notation J being retained 
for the total angular momentum of the electron envelope of the atom. The 
total angular momentum of the atom (including the nucleus) is denoted 
by F = J+i. Each component of the hyperfine structure is described by a 
definite value of this angular momentum. According to the general rules for 
addition of angular momenta, the quantum number F takes the values 


Passi, J+i=1,55|I=2], (121.1) 


so that each level with given J is split into 27+ 1 components ie < J, or 
2J+1ifi > J. 

Since the mean distances 7 between the electrons in the atom are large 
compared with the radius R of the nucleus, an important part in the hyperfine 
splitting is played by the interaction of the electrons with the lowest-order 
multipole moments of the nucleus. These are the magnetic dipole and electric 
quadrupole moments; the mean dipole moment is zero (see §75). 


The magnetic moment of the nucleus is of the order of Hnuc ~ eRvpuclC, 
where Unuc are the velocities of the nucleons in the nucleus. The energy of 
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its interaction with the magnetic moment of the electron (ue) ~ efi/mc) is of 
the order of 


MnucHel eh RYnuc 


Mr 








a ay 

r mce rè ( 
The quadrupole moment O ~ eR?; the energy of interaction of the field 

which it produces with the charge on the electron is of the order of 


eQ|r3 ~ e223, (121.3) 


Comparison of (121.2) and (121.3) shows that the magnetic interaction (and 
therefore the resulting splitting of the levels) is (vnuc/c) (A/mcR) ~ 15 times 
greater than the quadrupole interaction; although the ratio vnue/c is 
relatively small, the ratio A/mcR is large. 

The operator of the magnetic interaction of the electrons with the nucleus 
is of the form 


Pu =ai J (121.4) 


(similarly to the spin—orbit interaction of the electrons (72.4)). The depend- 
ence of the resulting splitting of the levels on F is therefore given by 


,aF(F +1); (121.5) 
cf. (72.5). 

The operator of the quadrupole interaction of the electrons with the nucleus 
is constructed from the operator Qix of the quadrupole moment tensor of 
the nucleus and the components of the angular momentum vector J of the 
electrons. It is proportional to the scalar OixJ; J; formed from these operators, 
i.e. has the form 


biii + iets — 2ili+ DEl ufc: (121.6) 


here we have used the fact that Qix is given in terms of the nuclear spin 
operator by a formula of the type (75.2). On calculating the eigenvalues of 
the operator (121.6) (in a manner entirely similar to the calculations in §84, 
Problem 1), we find that the dependence of the quadrupole hyperfine splitting 
of the levels on the quantum number F is given by the expression 


OFF +1)2+46F(F +1)[] -2/(J+1)—2i(i+ 1)}. (121.7) 


The magnetic hyperfine splitting effect is especially noticeable for levels 
due to an outer electron in the s state, owing to the comparatively high 
probability that such an electron will be near the nucleus. 

Let us calculate the hyperfine splitting for an atom containing one outer s 
electron (E. Fermi 1930). This electron is described by the spherically 


symmetric wave function ¥(7r) of its motion in the self-consistent field of the 
other electrons and the nucleus.t 


t The following calculation assumes that the condition Ze?/hic<1 is satisfied (cf. the second 
footnote to §120). 
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We shall seek the operator of the interaction with the nucleus as the operator 
— ji.H of the energy of the magnetic moment fi = pi/t of the nucleus in the 
magnetic field H created (at the origin) by the electron. According to a 
well-known formula of electrodynamics, this field is 





ee | AV, (121.8) 


where fis the operator of the current density due to the moving electron spin, 
and r = nr the radius vector from the centre to the element dV.t According 
to (115.4), 


= 2uBC curl (28) 


Come > 
i 


2 
dr 


where jg is the Bohr magneton. Writing dV = 7? drdo and carrying out the 
integration, we find 


Hi 


2 
~2us | dr | nx(nx do 
dr 
0 


= — 2u ?(0) Z8. 
The interaction operator is, finally, 
Pi = -f.H = end %(0} A (121.9) 


If the total angular momentum of the atom J = S = 4, the hyperfine 
splitting leads to the appearance of a doublet (F = 1+4); according to 
(121.5) and (121.9) we find for the distance between the two levels 


Ey — Ei = (877/32) upp( 22 + 1)y2(0). (121 .10) 


Since the value of (0) is proportional to »/Z (see §71), the magnitude of 
this splitting is proportional to the atomic number. 


PROBLEMS 
PROBLEM 1. Calculate the hyperfine splitting (due to magnetic interaction) for an atom 
containing (outside closed shells) one electron with orbital angular momentum # (E. Fermi 


1930). 


+ See Fields, (43.7). In that formula, the vector R is in the opposite direction, from dV 
to the centre (the point at which the field is observed). 
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SOLUTION. The vector potential and the magnetic field strength due to the magnetic 
moment p of the nucleus are 


A= #50 qy 3n(en)—# 
r’ r3 


(div A = 0). Using these expressions, we can write the interaction operator in the form 


EPE: Pe LiiR g = 2H Bo f4 3(S.n)n — $]. 
me me rs 


After averaging over a state with a given value of j, the expressiun in the brackets is in the 
direction of j. We can therefore write 


Pij = Ywwit-j{1.j + 3(8.n)(n.}) -8-7 Yj + 1). 


The mean value of minx has been calculated in §29, Problem. Using this and taking eigen- 
values, we find 
2p Bp» j| -21+ 1)s.į}— 6(s.1)(5.1)] —,., - 
ZEZE j 4 eee pe 1), 
rA ee tale } +1) 


whence, after a simple calculation, we have finally 


pee Ut Dacre yr, 
t j(j+1) 
where F = j +i, and j = +}. The averaging of r-3 is with respect to the radial part of the 


electron wave function. 


PROBLEM 2, Determine the Zeeman splitting of the components of the hyperfine structure 
of an atomic level (S. A. Goudsmit and R. F. Bacher 1930). 


SoLUTION. In formula (113.4) (the field being assumed so weak that the splitting which 
it causes is small in comparison with the hyperfine structure intervals), the averaging must 
be effected not only with respect to the electron state but also with respect to the directions 
of the nuclear spin. From the first averaging we get AE = ppggJsJ:H, with the same gy 
(113.7). The second averaging gives, analogously to (113.5), 


Jz = (J. F)Mr/F*. 
Thus we have finally 


i = Mr, = g 
AR Heer a, BF =g 2F(F +1) 


§122. Hyperfine structure of molecular levels 


The hyperfine structure of the energy levels of molecules is similar to that 
of the atomic levels. 

In the great majority of molecules the total electron spin is zero. The main 
source of the hyperfine splitting of the levels is then the quadrupole inter- 
action of the nuclei and the electrons; here, of course, only those nuclei 
participate in the interaction whose spin 7 is neither 0 nor 4, since otherwise 
their quadrupole moment is zero. 

On account of the comparative slowness of the motion of the nuclei in the 
molecule, the averaging of the quadrupole interaction operator with respect 
to the state of the molecule is effected in two stages: first we must average 


with respect to the electron state for fixed nuclei, and then with respect to 
the rotation of the molecule. 
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Let us first consider the diatomic molecule. The first averaging gives an 
interaction of each nucleus with the electrons that is expressed by an operator 
proportional to the scalar Oixning formed from the operator of the quad- 
tupole moment tensor of the nucleus and the unit vector n along the axis of 
the molecule—the only quantity which determines the orientation of the 
molecule with respect to the direction of the nuclear spin. Since Oi = 0, 
this operator can be written in the form 


bítik(nnk = Tôi) ; ( 1 22: 1 ) 


for a given value of the component i; of the nuclear spin along the axis of the 
molecule, this quantity is bft? — 4:(7+ 1)]. 

When the operator (122.1) is averaged with respect to the rotation of the 
molecule, it is expressed in terms of the operator K of the conserved rotational 
angular momentum. The averaging of the product nny is effected by means 
of the formula derived in §29, Problem (with the vector K in place of 1), 
and the result is 


b 


3 eonen thet ARs fôn K(K+1)]}. (0222 


The eigenvalues of this operator are found in the same way as for (121.6). 
For a polyatomic molecule we obtain in general, instead of (122.1), an 
operator of the form 


bixfiik, (122.3) 
where b; is a tensor with zero trace which is a certain characteristic of the 
electron state of the molecule. After averaging with respect to the rotation of 


the molecule, this tensor is given in terms of the total rotational angular 
momentum J by a formula of the type 


bir = bE Sut Se Si— 8NI + Six}. (122.4) 


The coefficient 6 can in principle be expressed in terms of the components 
of the tensor bix relative to the principal axes of inertia of the molecule 
&,7,¢; Since these axes are fixed in the molecule, the components bgp etc. 
are a property of the molecule and unaffected by the averaging. Let us con- 


sider the scalar bj,J;J,. A calculation using (122.4) gives 
bixJiJy = bI(J+1X4{J+1)-— 1}; (122:3) 


the method is similar to that used in §29, Problem. Expanding the tensor 
product in components along the axes £,7,¢, we obtain 


birJiJe = beg FP + byrde + bee Ie, (122.6) 


where we have used the fact that the mean values of the products J;J, etc. 
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are zero.t The mean values of the squares J? etc. are found, in principle, 
from the wave functions of the corresponding rotational states of the top. 
In particular, for a symmetrical top we have simply 


Jẹ = k; JP, JZ = MII4-1)— FI. 


If the spins of the nuclei are 4, the quadrupole interaction is absent. In 
this case one of the main sources of hyperfine splitting is the direct magnetic 
interaction between the nuclear magnetic moments. The operator of the 
interaction of two magnetic moments u; = 111/21, p2 = poio/ie is given by 


oa 


Sa io — 3(i1-n)(i2.n)]. 


t112 


To calculate the splitting energy, this must be averaged with respect to the 
state of the molecule, as described above. 

When the molecule contains heavy atoms, comparable contributions to the 
hyperfine splitting are given by the direct interaction and by the indirect 
interaction of the nuclear moments through the electron envelope. Formally, 
this interaction is an effect in the second approximation of perturbation 
theory with respect to the interaction of the nuclear spin with the electrons. 
By means of the results of §121 we easily find that the ratio of this effect to 
the direct interaction of the nuclear moments is of the order (Ze?|ħc)}?, and is 
comparable with unity for large Z. 

Finally, some contribution to the hyperfine splitting of molecular levels 
comes from the interaction of the nuclear magnetic moment with the rotation 
of the molecule. The rotating molecule, being a moving system of charges, 
Creates a certain magnetic field, which may be calculated, using the formulae 
of electrodynamics, from the given current density j = pQxr, where p is 
the charge density (of electrons and nuclei) in the molecule at rest, and Q its 
angular velocity of rotation. The magnitude of the level splitting is found as 
the energy of the magnetic moment of the nucleus in this field; the com- 
ponents of the angular velocity of the molecule must be expressed in terms 
of those of its angular momentum (cf. §103). 





t For in a representation where the matrix of one component of J (Jr, say) is diagonal, the 
matrices of the products Je Je, JnJ; have non-zero elements only when the quantum number k 
changes by 1, whereas the wave functions of stationary states of an asymmetrical top include 
functions ¢yx with values of & differing by an even number (see §103). 


CHAPTER XVII 


ELASTICCOELISTONS 


§123. The general theory of scattering 


IN classical mechanics, collisions of two particles are entirely determined 
by their velocities and impact parameter (the distance at which they would 
pass if they did not interact). In quantum mechanics the very wording of the 
problem must be changed, since in motion with definite velocities the concept 
of the path is meaningless, and therefore so is the impact parameter. The 
purpose of the theory is here only to calculate the probability that, as a result 
of the collision, the particles will deviate (or, as we say, be scattered) through 
any given angle. We are speaking here of what are called elastic collisions, 
in which the particles, or the interna! state of the colliding particles if these 
are complex, are left unchanged. l ; 

The problem of an elastic collision, like any problem of two bodies, amounts 
to a problem of the scattering of a single particle, with the reduced mass, in 
the field U(r) of a fixed centre of force.t This simplification is effected by 
changing to a system of coordinates in which the centre of mass of the two 
particles is at rest. The scattering angle in this system we denote by 8. 
It is simply related to the angles $, and $, giving the deviations of the two 
particles in the laboratory system of coordinates, in which the second particle 
(say) was at rest before the collision: 


tan 3, = me sin 8/(m-+-m2cos6), %: =4F(7—86), (123.1) 


where mı, mg are the masses of the particles (see Mechanics, §17). In par- 
ticular, if the masses of the two particles are the same (mı = me), we have 
simply 

91 = 46, d2 = 4(n—8); (123:2) 


the sum $,+$, = $r, i.e. the particles diverge at right angles. 

In this chapter we shall always use (unless the contrary is specifically 
stated) a system of coordinates in which the centre of mass is at rest, and m 
will denote the reduced mass of the colliding particles. 

A free particle moving in the positive direction of the z-axis is described 
by a plane wave, which we take in the form y = ef*2, i.e. we normalize so that 
the current density in the wave is equal to the particle velocity v. The 
scattered particles are described, at a great distance from the scattering 


+ Here we neglect the spin-orbit interaction of the particles (if they have spin). By assuming 
the field to be centrally symmetric, we exclude from consideration also processes such as the 
scattering of electrons by molecules. 
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centre, by an outgoing spherical wave of the form S(®e*r/r, where f(6) is 
some function of the scattering angle 6 (the angle between the z-axis and the 
direction of the scattered particle). This function is called the scattering 
amplitude. Thus the exact wave function, which is asolution of Schrédinger’s 
equation with potential energy U(r), must have at large distances the asymp- 


totic form 
ib ax ekz + f(B)ethr |r. (123.3) 


The probability per unit time that the scattered particle will pass through 
a surface element dS = r?do (where do is an element of solid angle) is 
(2/7?) |f|? dS = v|f|? do.+ Its ratio to the current density in the incident 
wave is 


do = |f(02 do. (123.4) 


This quantity has the dimensions of area, and is called the effective cross- 
section, or simply the cross-section, for scattering into the solid angle do. 
If we put do = 27 sin 6 dé, we obtain for the cross-section 


do = 27 sin 6 | f (0X2 de (123.5) 


for scattering through angles in the range from @ to 6+dé. 

A solution of Schrédinger’s equation for scattering in a central field U(r) 
must evidently be axially symmetric about the z-axis, the direction of the 
incident particles. Any such solution can be represented as a superposition 
of wave functions of the continuous spectrum, corresponding to motion, in 
the field concerned, of particles with given energy f2k2/2m and orbital 
angular momenta having various magnitudes / and zero #-components; 
these functions are independent of the azimuthal angle ¢ round the z-axis, 
i.e. they are axially symmetric. Thus the required wave function has the 
form 


yb = È A:Pi(cos §)Rxi(7), (123.6) 


where the A, are constants and the Rx are radial functions satisfying the 
equation 


l d/ dR, (1+1) 2m 
ey poe ee eee = Q. 123. 
r2 ai dr ) + 72 he U(r) [Re oa 


The coefficients A; must be chosen so that at large distances the function 
(123.6) has the asymptotic form (123.3). We shall show that this implies that 


l ' 
Ay = —(21+ 1)i! exp(i8;), (123.8) 
2k 

Eo a M 
t Itis supposed that the incident beam of particles is defined by a wide (to avoid diffraction 
effects) but finite diaphragm, as happens in actual experiments on scattering. There is there- 
fore no interference between the two terms of the expression (123.3); the squared modulus 

| 4|7 is taken at points where there is no incident wave, 
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where the ô; are the phase shifts of the functions Ry;. This will also solve 
the problem of expressing the scattering amplitude in terms of these phases. 
The asymptotic form of the function Rx: is given by (33.20): 


Ry % — sin (kr — lr + 51) 


2 
r 
Bs 


{(—7)! exp[i(kr + 6:)] —2! exp[ —i(kr + 8))]. 


ir 


Substituting this and (123.8) in (123.6), we obtain the asymptotic expression 
for the wave function in the form 


<> (21+ 1)Pi(cos 6){(—1)!e-#*r + Syet], (123.9) 


© —— 
Li 


y 


with the notation 


Sı = exp(2i8)). (123.10) 


The expansion of the plane wave (34.2), with the same transformation, is 


] AEPA 
mA (21+ 1)Pi(cos 6){(— 1)! +1e-ikr + ettr], 


eikz we 


~~? 


We see that, in the difference y — e**2, all terms containing the factors e—‘*r 
disappear, as they should. For the coefficient of e**7/r in this difference, 1.e. 
the scattering amplitude, we obtain 


1 (z9) 
f(0) = = 2 (214-1)[S;—1]P,(cos 8). (123.11) 


The formula solves the problem of expressing the scattering amplitude in 
terms of the ô; (H. Faxén and J. Holtsmark 1927). 

If we integrate do over all angles, we obtain the total scattering cross- 
section g, which is the ratio of the total probability (per unit time) that 
the particle will be scattered to the probability current density in the incident 


t The problem of recovering the form of the scattering potential from the phases ôr 
(assumed known) is of fundamental interest. This has been solved by I. M. Gel’fand, B. M. 
Levitan and V. A. Marchenko. It is found that, to determine U(r), it is in principle sufficient 
to know 4,(k) as a function of the wave number throughout the range from k = Otok = ©, 
together with the coefficients an in the asymptotic expressions (for r >œ) Rno™ (Gn/r)e is 
(xn = 4/(2m|E,|)/A) of the wave functions of states corresponding to the discrete (negative) 
energy levels En (if any). The determination of U(r) from these data requires the solution of a 
certain linear integral equation. This topic is fully discussed by V. de Alfaro and T. Regge, 
Potential Scattering, North-Holland, Amsterdam, 1965. 
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wave. Substituting (123.11) in the integral 


s = On i | (8)? sin 6 dð, 
0 


and recalling that the Legendre polynomials with different J are orthogonal, 
while 


T 
{ P;?(cos 6) sin @ dé = 2/(2/+-1), 
0 
we have for the total cross-section 
bon ce 
o = D, (2141) sin? (123.12) 
1-0 


Fach of the terms in this sum is a partial cross-section o; for the scattering 
of particles with given orbital angular momentum /. It may be noted that the 
maximum possible value of this cross-section is 


cr, max = (4r/k?)(21+ 1). (123.13) 


Comparing this with formula (34.5), we see that the number of particles 
scattered with angular momentum / may be four times the number of such 
particles in the incident flux. This is a purely quantum effect due to inter- 
ference between the scattered and unscattered particles. 

It will be useful later to employ also the partial scattering amplitudes 
fi, which we define as the coefficients in the expansion 


f(@) = F (21+ 1)fiPi(cos 8). (123.14) 
According to (123.11) these are related to the phases 8; by 
1 l 
= Sı — 1) = —(et— 1), 123.15 
fi z 1—1) aa ) ( ) 
and the partial cross-sections are 
or = 4n(2/+1)| fil. (123.16) 
PROBLEM 


Express the scattering amplitude in terms of the phase shifts in the two-dimensional case. The field 
U=U(p), P =./(x? + 27). The particle flux is in the z-direction. 

SOLUTION. In the two-dimensional case, the wave function far from the scatterer is a super- 
position of a plane wave and an outgoing cylindrical wave: 


Y =e t fepe ip). (1) 
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Here, ¢ is the angle between the z-axis and the direction of scattering, f (@) the scattering amplitude, 
which in the two-dimensional case has the dimensions of square root of length. The factor 
—i = exp(—}im) under the root sign is included in order to simplify the subsequent formulae. The 
scattering cross-section per unit length in the y-direction is 


do = | f|? dọ. 


This has the dimensions of length- 

The wave function is to be expanded in terms of functions having a definite -component m of 
the angular momentum, in the form Q..(p)je™. At large distances from the scatterer, the radial 
fanctions differ from those found in §34, Problem, for free motion only by a phase shift: 


Q, (p) =i" J (Zinkp) sin[kp — hx (m — 5) + òn]; 
with 6, =6_,. Repeating the arguments of §123 and using the plane wave expansion from §34, 


Problem, we find that the function having the asymptotic form (1) is given by the series 


co 


w= $ R, (ple, 


m= co 
and the scattering amplitude is 


] co 


a Bon ime 
S(O) = oak) &, © hier (2) 


Integration gives the total cross-section 


2n 


o= [iag = $ On, where 0, = O_» = (4k) sin? 6,,. 


0 


It is easily seen that 


im f (0) = /(h/87)0, (3) 


which is the optical theorem for the two-dimensional case; see (125.9). 


§124. An investigation of the general formula 


The formulae which we have obtained are in principle applicable to 
scattering in any field U(r) which vanishes at infinity. The use of these 
formulae involves only an examination of the properties of the phases ôr 
which appear in them. 

To estimate the order of magnitude of the phases 6, for large values of l, 
we use the fact that the motion is quasi-classical for large J (see §49). Hence 
the phase of the wave function is determined by the integral 


where r, is a zero of the expression under the radical (r > ro being the classi- 
cally accessible region of motion). Subtracting from this the phase 


i Pa 
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of the wave function of free motion, and letting r > œ, we obtain, by 
definition, the quantity ô. For large values of /, the value of rọ also becomes 
large; U(r) is therefore small throughout the range of integration, and we 
have approximately 


ô = — frue) dr/ře | [e-+] (124.1) 
r 
fFe 
In order of magnitude this integral (if convergent) is 
5; ~ mU(ro)ro/kh?. (124.2) 


The order of magnitude of 7, is rọ ~ Lk. 

If U(r) vanishes at infinity as i/r” withn > 1, the integral (124.1) converges, 
and the phases 6, are finite. On the other hand, for n < 1 the integral 
diverges, so that the phases ô, are infinite. This holds for any ¿, since the 
convergence or divergence of the integral (124.1) depends on the behaviour 
of U(r) for large 7, while at large distances (where the field U(r) is weak) 
the radial motion is quasi-classical for all J. We shall show below how the 
formulae (123.11), (123.12) are to be interpreted when 6, is infinite. 

Let us first consider the convergence of the series (123.12) which gives the 
total scattering cross-section. For large /, the phases 6; < 1, as is seen from 
(124.1) if we take into account the fact that U(r) decreases more rapidly 
than 1/r. Hence we can put sin26; = 6,2, and so the sum of the high terms 
in the series (123.12) will be of the order of 2er. From the well-known 


integral test for the convergence of series, we conclude that the series in 


question converges if the integral f /5;2d/ does so. Substituting here (124.2) 
and replacing / by kro, we obtain the integral 


co 
{ U 2(ro)ro° dro. 


If U(r) decreases at infinity as 1/r” with n > 2, this integral converges, 
and the total cross-section is finite. If, on the other hand, the field U(r) 
decreases not more rapidly than 1/72, the total cross-section appears to be 
infinite. The physical reason for this is that, when the field falls off only 
slowly with distance, the probability of scattering through small angles 
becomes extremely large. In this connection we may recall that, in classical 
mechanics, in any field which vanishes only as y > œ, a particle passing 
at any finite impact parameter p, however large, always undergoes a deviation 
through some angle which, though small, is not zero; hence the total 
scattering cross-section is infinite for anv law of decrease of U(r).t In 
quantum mechanics, this argument is invalid, since we can speak of scat- 
tering through a certain angle only if this angle is large compared with the 
indeterminacy in the direction of motion of the particle. If the impact 
parameter is known to within Ap, an indeterminacy f/Ap is caused in the 


t This is seen from the divergence of the integral | 27p dp which gives the total cross- 
section in classical mechanics. 
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transverse component of momentum, i.e. an indeterminacy ~ h]mvAp in 
the angle. 

In view of the important part played by small-angle scattering when U(r) 
decreases only slowly, the question naturally arises whether the scattering 
amplitude (6) diverges for 6 = 0, even when U(r) decreases more rapidly 
than 1/r2. Putting 6 = 0 in (123.11), we obtain for the high terms in the sum 
an expression proportional to 2 ð „ Arguing as in the previous case, our 


search for the criterion of the convergence of the sum leads us to the integral 


(ea) 


i U(ro)ro? dre, 


which diverges for U(r) ~ 1/r” with n < 3. Thus the scattering amplitude 
becomes infinite at @ = 0 for fields which decrease not more rapidly than 
1/r. 

Finally, let us consider the case where the phase 6, itself is infinite, as hap- 
pens when U(r) ~ 1/r" with n < 1. Itis evident from the results obtained 
above that, when the field decreases so slowly, both the total cross-section 
and the scattering amplitude for 6 = 0 will be infinite. There remains, 
however, the problem of calculating f (8) for 0 # 0. First of all, we notice 
that the formulat 


>, (2+ 1)Pi(cos 6) = 48(1—cos 8) (124.3) 


l=0 


holds. In other words, the sum vanishes for all 8 + Q. eee in the 
expression (123.11) for the scattering amplitude, we can omit unity in the 
square brackets in each term of the sum when 6 # 0, leaving 


f(® = = > (21-+-1)Pr(cos @)e?#41, (124.4) 
1-0 


If we multiply the right-hand side of the equation by the constant factor 
e-2i8, the cross-section will be unchanged, since it is determined by the 
squared modulus | f (8)|?, while the phase of the complex function f (8) 
is changed only by an unimportant constant. On the other hand, the diver- 
gent integral of U(r) cancels in the difference §,— 5, of expressions such as 


+ This formula is the expansion of the delta function in Legendre polynomials, and can be 


immediately verified by multiplying both sides by sin @ Pi(cos 6) and integrating over 6. 
Here the integral 


jl &(x) dx 
0 


of the even function 8(x) is taken to be $. 
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(124.1), and a finite quantity remains. Thus, to calculate the scattering 
amplitude in the case considered, we can use the formula 


1l m 
f(6) = D (21+ 1)Pi(cos 6)e21-60, (124.5) 
i-0 


§125. The unitarity condition for scattering 


The scattering amplitude in an arbitrary (not necessarily central) field 
satisfies certain relations which follow from general physical requirements. 

The asymptotic form of the wave function at large distances for elastic 
scattering in an arbitrary field is 


1 
p x ekma + — £(nyn’)ettr, (125.1) 
r 


This expression differs from (123.3) in that the scattering amplitude depends 
on the directions of two unit vectors, one (n) in the direction of incidence of 
the particles and the other (n’) along the direction of scattering, and not only 
on the angle between them. 

Any linear combination of functions of the form (125.1) with different 
directions of incidence n also represents a possible scattering process. 
Multiplying the functions (125.1) by an arbitrary coefficient F(m) and inte- 
grating over all directions n (solid angle element do), we can write such a linear 
combination as the integral 


etkr 
| F(n)etk™2' do+ ae f F(n)f(n,n’) do. (125:2) 


Since the distance r is arbitrarily large, the factor e#r™-n’ in the first integral 
is a rapidly oscillating function of the direction of the variable vector n. 
The value of the integral is therefore determined mainly by the regions near 
those values of n for which the exponent has an extremum (n = +n’). In 
each of these regions the factor F(m) ~ F(+n’) can be taken outside the 
integral, and the integration then givest 

eatkr etk 


ikr etkr 
—2riF(n')— + — ll f(n,n')F(n) do. 


27iF(—n’ 
( kr kr 





This expression can be written in a concise operator form, omitting the 
common factor 272/k: 
e—tkr ckr 


F(—n')— —SF(n’), (125.3) 


r r 





t To calculate the integral, we displace the path of integration with respect to the variable 
p = cos 8 (@ being the angle between n and n’) in the complex p-plane in such a way that it 
bends into the upper half-plane, the end points p = +1 being kept fixed. Then the 
function eT decreases rapidly as we move away from these end points. 
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where 


$ = 142ikf (125.4) 
and f is the integral operator defined by 


1 
fP) = — f fln,n’)F(n) do. (125.5) 


The operator S is called the scattering operator, the scattering matrix, or simply 
the S-matrix: it was first used by W. Heisenberg (1943). 

The first term in (125.3) represents a wave going in to the centre, and the 
second a wave going out from the centre. The conservation of the number of 
particles in elastic scattering is expressed by the equality of the total fluxes 
of particles in the ingoing and outgoing waves. In other words, these two 
waves must have the same normalization. To achieve this, the scattering 
operator must be unitary (§12), i.e. we must have 


$$ = 1, (125.6) 


or, substituting (125.4) and carrying out the multiplication, 
f—fr = 2ikff>. (125.7) 


Finally, using the definition (125.5), we can write the unitarity condition for 
scattering in the form 


ik 
flan’) — fn.) = 5 f f(nn’yf*(n’,n’) do”. (125.8) 


For n = n’ the integral on the right-hand side of the equation is just 
the total scattering cross-section o = f | f (n,n”)|2 do”. The difference 
on the left-hand side of the equation reduces in this case to the imaginary 
part of the amplitude f(n). Thus we obtain the following general relation 
between the total elastic scattering cross-section and the imaginary part of 
the amplitude of scattering through an angle zero: 


imf(n,n) = ko/4. (125.9) 


This is called the optical theorem for scattering. 

Another general property of the scattering amplitude can be derived from 
the requirement of symmetry with respect to time reversal. In quantum 
mechanics this symmetry is expressed by the fact that, if a function ys describes 
any possible state, then the complex conjugate function * also corresponds 
to a possible state (§18). Hence the wave function 


elkr e- ikr 
—F*(— n')— —S§*F*(n’), 
7 


r 


which is the complex conjugate of (125.3), also describes some possible 
scattering. process. We define a new arbitrary function by putting 
_ §*F*(n') = (—n’). Using the unitarity of the operator S, we then have 


§125 The unitarity condition for scattering 513 
Fe(n’) = —($*)10(—n’) = —S0(—n)); 


using the operator P of inversion of the coordinates, which changes the sign 
of the vectors n and n’, we can write 


F*(—n') = PF*(n') = —PSParn’). 


Thus we obtain the time-reversed wave function in the form 
e—tkr etkr — a 
@(—n’)— —PSPO(n’). 


rT r 





This must be essentially the same as the original wave function (125.3). 
Comparison shows that this implies the condition 


PSP = $; (125.10) 


then the two functions differ only in the notation for the arbitrary function. 

The corresponding relation for the scattering amplitude is found by 
changing from the operator equation (125.10) to a matrix equation. Trans- 
position interchanges the initial and final vectors n and n’, while inversion 
changes their sign. Hence we have 


S(n,n’) = S(—n’, —n), (125411) 
or, what is the same thing, 


f(n,n’) = f(—n’, —n). (125712) 


This relation (called the reciprocity theorem) expresses the obvious result that 
the amplitudes are the same for two scattering processes such that each is the 
time reversal of the other. Time reversal interchanges the initial and final 
states and reverses the direction of motion of the particles in those states. 

For scattering in a central field, the general relations obtained above can be 
simplified. In this case the amplitude f(n,n’) depends only on the angle @ 
between n and n’. The equation (125.12) therefore becomes an identity. 
The unitary condition (125.8) becomes 


kR 
im f(O) = — f FO" )do", (125.13) 


where y, y' are the angles between n, n’ and some direction n” fixed in space. 
If we use the expansion (123.14) for f (8), the addition theorem (c.10) for 
spherical harmonics gives from (125.13) the following relation for the partial 
amplitudes: 


im fr = RI fil2. (125.14) 
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This formula can also be derived directly from the expression (123.15), 
according to which |2zkf;+ 1|2 = 1. The optical theorem (125.9) is also easily 
deduced directly from formulae (123.11) and (123.12) for the case of scatter- 
ing in a central field. 

Rewriting (125.14) as im(1/f;) = — k, we see that the amplitude f; must 
have the form 


fi = 1/(&— 1k), (125115) 
where g; = gi(k) is a real quantity; it is related to the phase 5; by 


gi = k cot 8. (125.16) 


We shall several times make use of this formula for the amplitude. 

Let us examine (for scattering in a central field) the relation between the 
scattering operator defined above and the quantities which appear in the 
theory given in §123. 

Since the orbital angular momentum is conserved in a central field, the 
scattering Operator commutes with the angular momentum operator. In 
other words, the S matrix is diagonal in the / representation, and since the 
operator $ is unitary its eigenvalues must have unit modulus, i.e. must be of 
the form e?é, with real 5;. It is easy to see that these quantities are the same 
as the phase shifts of the wave functions, so that the eigenvalues of the S- 
matrix are the quantities S; defined in (123.10). The eigenvalues of the 
operator f = (S—1)/2zk are the partial amplitudes (123.15). For, if we take 
Pi(cos 6) as the function F(n) (so that F(—n) = P;(—cos 6) = (—1)!P, 
(cos 8)), the wave function (125.3) must be the solution of Schrédinger’s 
equation represented by a term in the sum (123.9). Thus SP,(cos 8) = 
S iP (cos 6). È 

For a plane wave incident along the z-axis, the function F(n) in (125.3) is 
the delta function F = 45(1—cos 8), where @ is the angle between n and the 
z-axis, the delta function is defined as indicated in the second footnote to §124, 
and the coefficient of it is so chosen as to give simply /(@) on substitution on 
the right-hand side of the definition (125.5); @ is now the angle between n’ 
and the z-axis. Writing the delta function in the form (124.3): 


F = 48(1 — cos 6) = z (21+ 1)P,(cos 6), (125217) 


{=0 


and applying the operator f to it, we find that the scattering amplitude has the 
form (123.14), as it should. 

Finally, we may add the following remark. Mathematically, the unitary 
condition (125.8) signifies that not every specified function f (n,n’) can be the 
scattering amplitude in some field. In particular, not every function f(@) can 
be the scattering amplitude in some central field. From (125.13), a certain 
relation must hold between its real and imaginary parts. If we write 
f(0) = | fet, then, when the modulus |f | is given for all angles, (125.13) 
gives an integral equation from which the unknown phase «(@) can in principle 
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be determined. In other words, from a scattering cross-section (i.e. | f |?) 
known for all angles we can in principle recover the amplitude. This process 
is, however, not completely unique and determines the amplitude only to 
within the alternative 


KO + —F*(), (125.18) 


which leaves the equation (125.13) unchanged (and of course does not alter 
the cross-section | f |*; the transformation (125.18) is equivalent to a simul- 
taneous change ot sign of all the phases 6; in (123.11)). This non-uniqueness 
is, however, removed if the scattering amplitude is regarded as a function of 
energy as well as angle. We shall see below (§§128, 129) that the analytical 
properties of the amplitude as a function of energy are not invariant under 
the transformation (125.18). 


§126. Born’s formula 


The scattering cross-section can be calculated in a general form in a very 
important case, namely that where the scattering field may be regarded as a 
perturbation.t It has been shown in $45 that this is possible when either of 
the two conditions 


|U] < h?/ma? (126.1) 
and 


[U| < hola = (k?ma?)ka (126.2) 


holds, 4 being the range of action of the field U(r) and U the order of magni- 
tude of the field in the range where it is significant. When the first condition 
is satisfied, the approximation is valid for all velocities; the second condition 
shows that it is always applicable for sufficiently fast particles. 

In accordance with §45, we seek the wave function in the form Ņ = 
POHD, where y® = e™ corresponds to an incident particle having 
wave vector k=p/k. From formula (45.3) we then have 


yi ) at f U( oe A €(k.r’+kR) oy 126 3) 
x,y,z) = --—— X Y Z jerls —. ; 
k 2r? f R i 
Taking the origin at the scattering centre, we introduce the radius vector 
R, from the origin to the point where the value of 4) is required, and 
denote by n’ a unit vector along Ry. Let the radius vector of a volume 


element dV’ be r’; then R = Rọ—r'. At large distances from the centre, 
Ry > 7’, so that 


pa IRo- r'| ~ Ro— xr’ pati: 


Substituting this in (126.3), we have the following asymptotic expression for 


+ In the general theory derived in §123 this approximation corresponds to the case where all 
the phases 8: are small ; it is also necessary that these phases can be calculated from Schrédin- 
ger’s equation with the potential energy regarded as a perturbation (see Problem 4). 
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pO): 


eikR, 





| U(r’ etek” AV" 


(where k’ = kn’ is the wave vector of the particle after scattering). Compar- 
ing this with the scattering amplitude given by formula (123.3), we find for 
the latter the expression 


m 
feos f Ue-tar dV, (126.4) 


where we have renamed the variable of integration and introduced the vector 
q =k’—-k, (126.5) 


whose absolute magnitude is 
q = 2k sin36, (126.6) 


6 being the angle between k and k’, i.e. the scattering angle. 

Finally, squaring the modulus of the scattering amplitude, we have the 
following expression for the cross-section for scattering into the solid angle 
element do: 


2 
do. (126.7) 


m2 


O = 
4n2h4 








{ Uetard V 





We see that the scattering with a momentum change fiq is determined by the 
squared modulus of the corresponding Fourier component of the field U. 
Formula (126.7) was first obtained by M. Born (1926). In the theory of 
collisions, the approximation considered here is often called the Born approxi- 
mation. 

It may be noted that, in this approximation, the relation 


fikik')=f*(k',k) (126.8) 


holds between the amplitudes of the direct and inverse scattering processes, 
i.e. processes differing by the interchange of the initial and final momenta, 
without the change of sign such as occurs in time reversal. Thus another 
symmetry property, in addition to the reciprocity theorem (125.12), appears 
in scattering. This property is closely reiated to the smallness of the scatter- 
ing amplitudes in perturbation theory, and follows immediately from the 
unitarity condition (125.8) if we neglect the integral term quadratic in f-t 

Formula (126.7) can also be obtained by another method (which, however, 
does not determine the phase of the scattering amplitude). We can start 
from the general formula (43.1), according to which the transition probability 


+ Hence it.is clear that this property no longer holds ın even the second approximation of 
perturbation theory. This will be proved directly in §130; cf. (130.13). 
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between states of the continuous spectrum is given by the expression 
dw i = (2r /h)| Upil? d(Ey— E;)dv. 


In the case under consideration, we have to apply this formula to a transition 
from the state of the incident particle with momentum p to the state of the 
particle, with momentum p’, scattered into the element of solid angle do’. 
As the interval of states dv, we can take d%p’/(27h)8. Substituting for the 
difference of the final and initial energies 


E;— E; = (p'*—p®)/2m, 


we obtain 


dwy, = (47m/h)| U ppl28(p'?— p?) dp’ (27h). (126.9) 


The wave functions of the incident and scattered particles are plane waves. 
Since we have taken as the interval of states dvp an element of p/27h space, the 
final wave function must be normalized by the delta function of p/2ai: 


Yp = di'r, (126.10) 


We norm-uze the initial wave function to unit current density: 


Yp = Vim[p)es Mo, (126.11) 


Then (126.9) will have the dimensions of area, and is the differential scattering 
cross-section. 

The presence of the delta function in formula (126.9) means that p’ = p, 
i.e. the absolute magnitude of the momentum is unchanged, as it should 
be in elastic scattering. We can remove the delta function by changing to 
spherical coordinates in momentum space (i.e. by replacing d%p’ by p'?dp' 
do’ = hp'd(p’2)de|) and integrating over p'?. The integration amounts to 
replacing p’ by p in the integrand, and we obtain 


do = (mp/4n?h4)| [Y¥p"UypdV Pde’. 


Substituting the functions (126.10), (126.11), we reach once more the final 
expression (126.7). 

In the form (126.7), this formula is applicable to scattering in a field 
U(x, y, z) which is any function of the coordinates, and not only a function of 
r. In the case of a central field U(r), however, this formula can be further 


transformed. In the integral 


j U(rje—ta.r d V 


we use spherical space coordinates r, $, ¢, with the polar axis in the direction 
of the vector q, denoting the polar angle by $ to distinguish it from the 
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scattering angle 6. The integration over $ and¢ can be effected, and we obtain 





O2n n Leo) 
sin gr 
ff U(r)etar 003 872 sind dddddr = 4rr f U(r) r dr. 
q 
000 0 


Substituting this expression in (126.4), we obtain the following formula 
for the scattering amplitude in a centrally symmetric field: 





r dr. (126.12) 


For 6 = 0 (i.e. q = 0), the integral diverges when U(r) decreases at infinity 
not more rapidly than 1/r3 (in acordance with the general results of §124). 

We may call attention to the following interesting fact. The momentum p 
of the particle and the scattering angle @ enter (126.12) only through q. Thus, 
in the Born approximation, the scattering cross-section depends on p and @ 
only in the combination p sin 38. 

Returning to the case of arbitrary fields U(x, y, z), let us consider the 
limiting cases of small velocities (ka < 1) and large velocities (ka > 1). For 
small velocities, we can put e~‘* ~ 1 in (126.4), so that 


m 


a Fae U dV, (126.13) 


while if U = U(r), 


2m r : 
f=-= f Ulryr? dr. (126.14) 
0 


Here the scattering is isotropic and independent of the velocity, in accord- 
ance with the general results of §132. 

In the opposite limiting case of high velocities, the scattering 1s markedly 
anisotropic and mainly forward in a narrow cone of angle A8 ~ 1/ka; since 
outside this cone the quantity q is large, the factor e~'4¥ is a rapidly oscillating 
function, and the integral of its product with the slowly varying function 
U is almost zero. 

The law of decrease for large q is not universal and depends on the specific 
form of the field. If the field U(r) has a singularity at r = 0 or at any 
other real value of 7, the integral (126.12) is mainly determined by the range 
near the singular point, and the cross-section decreases according to a power 
law. The same applies to the case where the function U(r) has no singularity 
but is not an even function; here the region near r = 0 is the most important 
in the integral. If U(r) is an even function of r, however, the integration may 
be formally extended to negative values of r, 1.€. taken along the whole of the 
real axis of the variable r, after which (if U(r) has no singularity on the real 
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axis) the path of integration may be moved into the complex plane until it 
meets the nearest complex singularity. Then, for large q, the integral will 
decrease exponentially. It should be borne in mind, however, that the Born 
approximation is in general inadequate to calculate this exponentially small 
quantity (see also §131). 

Although the value of the differential scattering cross-section within the 
cone A0 ~ 1/ka does not depend greatly on the velocity, the total scattering 
cross-section (assuming that the integral f do does converge) decreases at high 
energies owing to the decreasing angle of the cone, in proportion to the solid 
angle of the cone, i.e. as (å)? ~ 1/k2a?, or inversely as the energy. 

In many physical applications of collision theory the quantity which 
describes the scattering is the integral 


Ctr = Í (1—cos 6) do, (126.15) 
often called the transport cross-section. Arguments similar to those given above 


show that at high velocities this quantity is inversely proportional to the square 
of the energy. 


PROBLEMS 
PROBLEM 1. Determine, in the Born approximation, the scattering cross-section for a 
spherical potential well: U = ~U, forr < a, U = Oforr > a. 


SoLuTion. The calculation of the integral in (126.12) gives 





j pom ( mUoa? y (sin ga—ga cosga)? 
o = 4a ———_ 


he (ga)é 


The integration over all angles (which is conveniently effected by using the variable 
q = 2k sin 40 and replacing do by 27g dg/k?) gives the total scattering cross-section 


fej 


27 {mUoa* \* i 1 sin4ka sin? | 
= =| ——.... — — 4 e c e ié'=t=8 
k? ( he ) [ (2ka)? (2ka)®  (2ka)î 


In the limiting cases this formula gives 











l6ra? / mUa? \ 2 
ae ( ) for ka <1, 
9 h? 
2r fmUpa* \ 2 
o= =( ) for ka > 1. 
R2\ k 


PROBLEM 2. The same as Problem 1, but in a field U = Upe-r?/a", 


SOLUTION. The calculation is conveniently effected from formula (126.7), taking the 
direction of q along one of the coordinate axes, The result is 





mUpa? \2 
do = trae( = ) E9782 do, 
and the total cross-section is 
m2 smUpa2\2 
= Fe ( pe) ey 
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The condition for these formulae to be applicable is given by the inequalities (126.1), (126.2) 
with U» in place of U. The formula for do is also inapplicable if the exponent ts large in 
absolute magnitude.t 


PROBLEM 3. The same as Problem 1, but in a field U = (a/r)e-*/a, 
SoLuTion. The calculation of the integral in (126.12) gives 





ama \2 do 
do = sat ) —_—__—_., 
he | (g2a2+1)2 
The total cross-section is 
ama \2 1 
za 16ra(—— es, 
A? J 4k2a2+) 


The condition for these formulae to be applicable is found from (126.1) and (126.2) with 
aja instead of U: ama/h? <1 ora/hv <1. 


PROBLEM 4. Determine the phases ô; for scattering in a centrally symmetric field for the 
case corresponding to the Born approximation. 


SOLUTION. For the radial wave function x = rR for motion in the field U(r), and for the 
function x for free motion, we have the equations (see (32.10)) 


fe Kl+1) 2m 
a har Uno 


(i41) 
0” pc (0) — 
xO" + E : fk 0. 


Multiplying the first equation by x!°), the second by x, and subtracting, followed by integra- 
tion with respect to r (using the boundary condition x = 0 atr = 0), we obtain 








7 f 
(xr) ) = -- Í Up dy, 
0 


\ 


Regarding U as a perturbation, we can put xx on the right-hand side. For ro the 
asymptotic expressions (33.12), (33.20) can be used on the left-hand side, while in the integral 
we substitute the exact expression (33.10). The result is 


sind; vd; = — z | U(r)[ Jiexy2(kr) }*r dr. 
Le) 


This formula could also be derived by a direct expansion of the Born scattering amplitude 
(126.4) in Legendre polynomials in accordance with (123.11) (for small 6)). 


PROBLEM 5. Determine in the Born approximation the total scattering cross-section in a 
field U = a/(r?+.a*)"’? with n> 2, for fast particles (ka> 1). 


SOLUTION. We shall see that in this case the partial amplitudes with large angular 
momenta / are predominant in this scattering. The cross-section may therefore be calculated 
from formula (123.11). replacing the summation over / by integration; in the Born approxi- 
mation, all 6;<1, so that 


eon =| 218,2 dl. (1) 
k2 
0 


+ The inapplicability of perturbation theory in this case 1s easily seen by calculating the 
scattering amplitude in the second approximation (see (130.13)); although the coefficient of the 
exponential is small in comparison with the coefficient in the first-approximation term, the 
magnitude of the negative exponent is only half as great. 
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The phases 6; with large / are calculated from (124.1): 


iv @] 

open dr 

: he | (r24a2)" k Rra | 
Lk 


By the substitution r? +a? = (a?+/*/k*)/é, the integral is brought to the familiar Euler form, 
and the result is 
mak” -2 P(3)P(in— 3) 


Stes en arene ees 
i 2h2(a2k2 + 12)(n-1)/2 T(4n) 


(2) 


The integral (1) is determined by the range /~ ak> 1, and this justifies the assumption made. 
A calculation of the integral gives the result 


zar g a 








Tin) | \khan-? (3) 


n—2L [(4n) 


According: to (126.2), the condition for the Born approximation to be valid in this case is 
mal/h*ka"-1<€1. Note the dependence o~k~?, which is in accordance with the general 
statement made above. 


PROBLEM 6. Determine in the Born approximation the scattering amplitude in the two- 
dimensional case of a field U = U(x, z) with the particle flux incident along the z-axis. 


SoLuTion. Using the second footnote to §45 and the kno’ ʻi asymptotic expression of the 
Hankel function 


Ho (u) = = elit-n/4) as u —> o, 
TU 


we find for the correction to the wave function at large distances R, from the field axis (the 
y-axis) the expression 

y oa SP) ar 

ve 


0 


where the scattering amplitude is 


m 


SA in j4 Q- 2 
f$) = RJ (2nky [vox a-F dp, 


with p = (x,z) the two-dimensional radius vector, d’p = dxdz, and @ the scattering angle in the 
xz-plane. In the two-dimensional case, the scattering amplitude has the dimensions of square root 
of length, and the scattering cross-section da = | f|? do those of length. 


§127. The quasi-classical case 


Let us investigate the manner in which the passage occurs from the 
quantum-mechanical theory of scattering to the limit of the classical theory. 
Omitting from consideration a scattering angle 6 of zero, we can write 


the scattering amplitude given by the exact theory in the form (124.4): 
F£(8) = (1/2ik) p (21+1} Pi(cos O)e2#1, (127.1) 


We know that the quasi-classical wave functions are characterized by having 
large phases. It is therefore natural to suppose that large phases 8, 
correspond to the passage to the limit of the classical theory of scattering. 
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The value of the sum (127.1) ts mainly determined by the terms with large /. 
Hence we can replace P, (cos 8) by the asymptotic expression (49.7), which 
we write in the form 


Py(cos@) x [efll+1/2)6-+10/4__ 941/216 -t77/4), 


~-4/(2nl sin 8) 


Substituting this expression in (127.1), we obtain 


2r sinf 


f(8) = : > J | galaan efl28,H(141/200+7 / 41) 
k 
' (127.2) 


The ezponential factors, regarded as functions of J, are rapidly oscillating 
functions, since their phases are large. The majority of the terms in the 
sum in (127.2) therefore cancel. The sum is mainly determined by the range 
of values of J near that for which one of the exponents has an extremum, 
i.e. near the root of the equation 


2 d&/d/-L6 = 0. (127.3) 


In this region there are a large number of terms in the series for which the 
exponential factors have almost the same value (since the exponents vary 
slowly near the extremum), and which therefore will not cancel. 

The phases ô; in the quasi-classical case can be written (see §124) as the 
limit to which the difference between the phase 


1 r 
grt | VimE-UO -EUD dr 


of the quasi-classical wave function in the field U(r) and the phase 
kr—}lr 
(see §33) of the wave function of free motion tends as r > œ. Thus 


[e a 


1 
8) = f (v I2mE— U=)? dr+łn(l+})— kro. (127.4) 


Ta 


This expression is to be substituted in equation (127.3). In finding the 
derivative of the integral, it must be remembered that the limit of integration 


7o also depends on /; the term & dr,/di arising trom this, however, cancels 
with the derivative of the term —A&r, in ô, 

fi(1+44) is the angular momentum of the particle. In classical mechanics, 
it can be written in the form mpv, where p is the impact parameter, and v 
is the velocity of the particle at infinity. We make this substitution; equation 
(127.3) then takes the final form 
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f ee! ; 127.5 
j AmE- Uim TO ee) 
Ta 

In a repulsive field this equation has a root (for p) only for a minus sign in 

front of @ on the right-hand side, and in an attractive field only for a plus 

sign. 

Equation (127.5) is exactly the same as the classical equation which deter- 
mines the scattering angle from the impact parameter (see Mechanics, §18). 
It is easy to see that the classical expression for the cross-section is in fact 
obtained. 

To prove this, we expand the exponent in (127.2) in powers of l’ = 1—[9(6), 
where Jo(@) is determined by equations (127.3)-(127.5). We shall take in 
particular the first term in (127.2), and accordingly the lower sign in (127.3) 
(repulsion). According to (127.3), 


[d28;/d/2]7~7, = $d6/dlo, 
and so 


i[28; — (L+ $)6 — da] &7[28), — (lo + $)8 — 4r] + 32(d 6/dlo)l’. 


The summation over / in (127.2) is now replaced by integration over l’ near 
the point l = 0. Regarding l’ as a complex variable, we take the path of 
integration near this point in the direction of steepest decrease of the ex- 
ponent, i.e. at an angle of 47 or —47 to the real axis, according to the sign of 
d6/dlo. In other words, we put l' = £ exp( + łi7) and integrate over real 
values of £; owing to the rapid convergence of the integral, it can be extended 
from — œ to œ: 


f exp( —4£2|d6/dlo])dé = (2n|dlo/d6])*/2. 


-® 


The result is 





ey cle 
£(%) = 5 ( 


1/2 
=a saa) exp {7[26, —(/o+4)@—in]}. (127.6) 





Hence 


do 


| f |? . 27 sin 6 dé 
2m(lo/k?)| dlo/d 6] da, (127.7) 


and with the impact parameter p = /o/k we arrive at the classical formula 
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do = 2p dp. 

Thus the conditions for classical scattering through a given angle @ are 
that the value of Z for which (127.3) holds should be large, and that 8; should 
also be large for this value of J.t This latter condition has a simple inter- 
pretation. If we can speak of classical scattering through an angle 6 when 
the particle is incident at an impact parameter p, it is necessary that the 
quantum-mechanical indeterminacies of these two quantities should be 
relatively small: Ap < p, A0 < 6. The indeterminacy in the scattering 
angle is of the order of magnitude A@ ~ Ap/p, where p is the momentum of 
the particle and Ap is the indeterminacy in its transverse component. Since 
Ap ~h/Ap > hlp, we have A0 > h/pp, and thus 


8 > hlpmv. (127.8) 


Replacing the angular momentum mpv by hil, we obtain 6] > 1, which is 
the same as ô; > 1 (since 6; ~ l0, as we see from (127.3)). 

The classical angle of deviation of the particle can be estimated as the 
ratio of the transverse momentum increment Ap during the “‘collision time” 
T ~ p/v and the original momentum mv. The force acting on the particle at a 
distance pis F = —dU(p)/dp; hence Ap ~ Fp/v, so that 6 ~ Fp/mv?. This 
estimate is strictly valid only if @ < 1, but it can be applied to give an order 
of magnitude even if 0 ~ 1. Substitution in (127.8) gives the condition for 
quasi-classical scattering in the form 


|F| p? > ho. (127.9) 


This inequality must hold for all values of p such that |U(p)| < £. 

If the field U(r) decreases more rapidly than 1/r, the condition (127.9) 
always ceases to be satisfied for sufficiently large p. Small 6, however, 
correspond to large p; thus scattering through sufficiently small angles is 
never classical. If, on the other hand, the field decreases less rapidly than 
l/r, the scattering through small angles is classical; whether the scattering 
through large angles is classical in this case depends on the behaviour of the 
field at small distances. 

For a Coulomb field, U = ajr, the condition (127.9) is satisfied if a > Av. 
This is the opposite condition to that for which the Coulomb field can be 
regarded as a perturbation. We shall see, however, that the quantum theory of 
scattering in a Coulomb field leads to a result which, as it happens, is always 
in agreement with the classical result. 


+ The relation between 8 and p given by (127.5) may not be one-to-one, more than one 
value of p may correspond to the same value of 6. In such a case, the amplitude f (8) is given 
by the sum of (127.6) with the appropriate values of /,. At extrema of @(p) the derivative 
dp/d@ and therefore the classical differential cross-section do/do become infinite; near such 
an angle, the classical approximation is of course invalid (see Problem 2). 
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PROBLEMS 


PROBLEM 1. Find ‘the total cross-section for quasi-classical scattering in a field which has 
the form U = gajr” (n> 2) at sufficiently large distances. 


SoOLuTion. Bearing in mind that the phases ô with large l are the most important, we 
calculate them from (124.1): 


(a) 


; mx dr 
mo Í rm4/(k2— [2)r2) 
i/k 


__makn-2 TG) Gn) 


= 1 
Zhe In -1 (47) ii 


see §126, Problem 5, for the calculation of the integral. 
Replacing the summation in (123.12) by an integration, we write 


wo 
4n 
c=— f 2l sin?5, dl. 
k2 
0 


We substitute 6; = u and integrate by parts with respect to x, reducing the integral to a 
gamma function. The result is 


a = Jonin-iain [+ EFE Sy" -D ( : ji -1) 5 


For n = 3, the indeterminacy can be resolved to give o = 2r?alhħv. 
The chief condition for the applicability of this result is that / > 1 for & ~ 1; this gives 
the inequality 








mxk"-2/h2 > L. 


A further condition arises from the requirement that the field U(r) should have the form in 
question from distances 


r~ljk~(ma!ħ?k) uin- 


outwards (/ being obtained from &; ~ 1), these distances playing the principal part in the 
integral (1). If this form is reached only at distances r > a, where a represents the charac- 
teristic dimensions of the field, we have the condition 


malh?kan-1> 1, 
which places an upper limit on the permissible velocities. In this case, for sufficiently high 
velocities (ma/h*ka"- € 1), o ~ k-? (cf. §126, Problem 5). 


PROBLEM 2. Find the angular distribution of scattering near an extremum of the classical 
scattering angle 8(p) as a function of the impact parameter p = //k. 


SOLUTION. The presence of an extremum of &(/) for some / = l, implies, according to 
(127.3), that the phase &; near this point has the form 


26) = 26), + Gol’ + Lal's, 


where 8, = Glo), L = = la; here we again take the Particular case of the lower sign in (127.3). 
The constant @ is negative and positive respectively for a maximum or minimum of the 
function 6(/). For the Scattering amplitude we have instead of (127.6) 


l l u2 f 
8 = = . 2 Oe fs pg s ‘ 
| f (8y AE Ta = [exp UC — 28" + kal) dl’, 
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where 6 = @—6,. Expressing the integral in terms of the Airy function by means of (b.3), 
we finally have for the scattering cross-sectionT 


ms da, 2 p 
do = TTT (-5) dé 


The differential cross-section do/d@ decreases with increasing distance into the classically 
inaccessible region of scattering (6 >0 for x<0, or 6’ <0 for x>0); on the other side of the 
point 6’>0, it oScillates between zero and a gradually decreasing amplitude. Its maximum 
value occurs for 6’~~2'3 = 1-02, where ©? = 0-90. 








PROBLEM 3. Find the angular distribution of quasi-classical scattering at smali angles, if 
the classical angle of deviation @ is zero for some finite value of p = l/k- 


SOLUTION. The assumption of quasi-classical scattering means here that h > 1 and 
51, < 1. Then the values of / close to /, are important in scattering. For small l’ = 1—dy we 
have 


8, = 8), + Spl’2; 


then, according to (127.3), 8 = 0 for l’ = 0. This expression is to be substituted in (127.1), 
and P:(cos 6) can be written in the form (49.6). The summation over / is again replaced by 
integration over l’ near the point l’ = 0: 


f = (lotih) exp (2181) | Jo(l6) exp (igl?) dl’ 


The integral is determined by the range l’ ~ 8-12. For angles 8 < yẹ, we can take the 
function J,(/@) outside the integral and use its value at ? = l. The remaining integral is 
calculated as shown above. The result for the cross-section is |) 


do = (mlo?/Bk*) Jo2(lo8) do. 


A similar result is found for the cross-section for scattering at angles close to v if the clas- 
sical scattering angle tends to 7 for some finite (non-zero) value of p. 


§128. Analytical properties of the scattering amplitude 


A number of important properties of the scattering amplitude can be 
established by considering it as a function of the energy E of the particle 
undergoing scattering, this energy being formally regarded as a complex 
variable. 

Let us consider the motion of a particle in a field U(r) which vanishes 


sufficiently rapidly at infinity (the necessary degree of rapidity will be specified 
later). To simplify the discussion we shall first suppose that the orbital 
angular momentum / of the particle is zero. We can write down the asymp- 
totic form of the wave function (the solution of Schrédinger’s equation with 


l = 0 for any given value of E) as 


+ This type of scattering occurs in the theory of the rainbow, and it is therefore called 
rainbow scattering. — 

{ Strictly speaking, the amplitude should include a term representing the contribution to 
small-angle scattering from impact parameters p >œ. This contribution, however, 1S 1n 
general small in comparison with that shown 

' This type of scattering ts called Zuminescence, from its occurrence in the theory of certain 
meteorological.phenomena. 
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x = n} = AE) exp(— mr) + B(E) ap E, (128.1) 


and regard E as a complex variable, defining 4/— E as being positive when E is 
real and negative. The wave function is assumed normalized by some definite 
condition, say (0) = 1. 

On the left half of the real axis (E < 0) the exponential factors in the two 
terms in (128.1) are real; one decreases and the other increases as r > œ. 
From the condition that y is real it follows that the functions A(E) and B(E) 
are real for E < 0, and from this in turn it follows that these functions have 
complex conjugate values at any two points lying symmetrically about the 
real axis: 


A(E*) = A*(E),  B(E*) = B*E). (128.2) 


On going from the left half to the right half of the real axis through the 
upper half-plane we obtain an asymptotic expression for the wave function 
for E > Q0 imn the form 


y = A(Eetr-+ B(E)etir, k = 4/(2mE) fh. (128.3) 
If a path through the lower half-plane is used, however, the result is 
x = AX*(E jek + B¥(E)et*r, 
Since x must be a single-valued function of E, this means that 


A(E)= BYE) for E> 0; (128.4) 


this relation also follows directly from the fact that y is real for E > 0. 
Nevertheless, because the root 4/— E in (128.1) is not single-valued, the co- 
efficients A(E) and B(E) themselves are not single-valued. To avoid this, we 
cut the complex plane along the right half of the real axis. The cut makes 
4/—£ single-valued, and so the functions A(E) and B(E) are uniquely 
determined. They have complex conjugate values on the upper and lower 
edges of the cut (in (128.3), A(E) and B(E) are taken on the upper edge). 

The complex plane cut in the manner described above will be called a 
physical sheet of the Riemann surface. According to our definition we have 
everywhere on this sheet 


re /—E > 0. (128.5) 
In particular, on the upper edge of the cut 4/-—£ thus defined becomes 


—iy/ E} 


+ Inthe rest of this section we shall be considering the properties of the scatrering amplitude 
on the physical sheet. Later, however, it will sometimes be necessary to consider another 
non-physical” sheet of the Riemann surface (see §134). On this sheet 


re y—E <0. (128.5a) 


The passage from the right half of the axis to the non-physical sheet is made directly down 
through the cut. : 
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In (128.3) the factors e* and e-‘*7, and so also the two terms in y, are 
quantities of the same order of magnitude; an asymptotic expression of the 
form (128.3) is therefore always legitimate. Everywhere else on the physical 
sheet the first term in (128.1) decreases exponentially, and the second term 
increases exponentially, as r > œ (because of (128.5)). Hence the two terms 
in (128.1) are of different orders of magnitude, and this expression may not 
be legitimate as the asymptotic form of the wave funstion: the small term 
compared with the large one may represent an unjustified exaggeration of 
accuracy. For the expression (128.1) to be legitimate the ratio of the small 
and large terms must not be less than the relative order of magnitude of the 
potential energy U/E, which is neglected in Schrédinger’s equation on going 
to the asymptotic region. In other words, the field U(r) must be such that 


2/(2m) 
hi 





U(r) decreases more rapidly than exp( — rre+/— E) asr —> co. (128.6) 


When this condition is satisfied, for any re ./(— E) > 0, that is, if U (r) 
decreases more rapidly than 


e`" (128.6a) 


with any positive constant c, the asymptotic expression of the form (128.1) 
is valid everywhere on the physical sheet. Being a solution of an equation 
with finite coefficients, it has no singularity with respect to £. This means 
that the functions A |) and B(E) are regular everywhere on the physical 
sheet except the point E = 0, which, being the point where the cut begins, 
is a branch point of these functions. 

The bound states of a particle in the field U(r) correspond to wave functions 
which vanish as r > co. This means that the second term in (128.1) cannot 
appear, i.e. the discrete energy levels correspond to zeros of the function 
B(E). Since Schrédinger’s equation has only real eigenvalues, all the 
zeros of B(E) on the physical sheet are real (and lie on the left half of the real 
axis). 

The functions 4(Z) and B(E) for E > 0 are directly related to the scattering 
amplitude in the field U(r): comparing (128.3) with the asymptotic expression 
for x written in the form (33.20), 


y = constant x [et(kr+8o — e-f(kr+40)], (128.7) 


we see that 
— A(E){/B(E) = e2#6elE), (128.8) 


The scattering amplitude with angular momentum / = 0 is, according to 


(123.15), 


1NR" N (128.9) 


] 
& E pag -5 
fo= 53° )= (a amE)\ B 
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here A and B are taken on the upper edge of the cut. 

Considering now the Scattering amplitude as a function of E over the whole 
physical sheet, we see that the discrete energy levels are simple poles of this 
function. If the field U(r} satisfies the condition (128.6a), the above dis- 
cussion shows that the scattering amplitude has no other singular points. t 


discrete level E = Eo < 0. To do SO, we write down the equations satisfied 





„ 2m E-U) 0 cy \" 2m E-U) ey 2m 
+ T = y r J- re — == — —— æ 
N (=) pe gen ag 


Multiplying the first by 6y/éE, the second by x, subtracting, and integrating 
with respect to r, we obtain 


ôy êĉx\ mp , 
— —y(—— } = — dr. 128.10 
ae tee) we) i ) 


We apply this relation for E = Eo and r > œ. The integral on the right- 
hand side becomes unity for r —> œ if the wave function of the bound state is 
normalized by the usual condition | x? d= 1."On the left-hand side we 
substitute y from ( 128.1), using the fact that, near the point E = Ep, 


A(E) = A(Eo) = Ao,  B(E) = (£+|£o))[dB/dE]e-£, = S(E+|Eo)). 


The result is 
l m 


—— = eee 


AohN 2| Eo} 


By means of these expressions we find that, near the point E = Ep, the prin- 
cipal term in the scattering amplitude (i.e. the amplitude for / = 0) has the 
form 

h?Ao 1 


2m E-+|Eo| 


— oa 


(128.11) 


Thus the residue of the Scattering amplitude at the discrete level js deter- 
mined by the coefficient Ao in the asymptotic expression 


eA, exp( a, (128.12) 


of the normalized wave function of the corresponding stationary state. 
Returning to the examination of the analytical Properties of the scattering 
amplitude, let us consider cases where the condition (128.6) is not satisfied. 


eer, 


t Except the point E = Q, which 18 singular, because of the singularity of A(E) and B(E) 
previously mentioned, The Scattering amplitude, however, remains finite as E 0 (see §132) 
In future we shall, for brevity, omit this qualification. 1 
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In such fields only the increasing term in (128.1) is the correct part of the 
asymptotic form of the solution of Schrédinger’s equation over the whole 
of the physical sheet. Accordingly, we can, as before, assert that the function 
B(E) has no singularity. 

The function A(£) under these conditions can be determined in the complex 
plane only as an analytical continuation of the function which is the coefficient 
in the asymptotic expression for y on the right half of the real axis, where the 
two terms in y are both legitimate. In general, however, such a continuation 
now gives different results according as it is carried out from the upper or the 
lower side of the cut. To obtain a single-valued function, we shall agree to 
define A(E) in the upper and lower half-planes as the analytical continuation 
from the upper and lower sides of the right half of the real axis respectively; 
the cut must then in general be extended to the whole of the real axis. The 
function thus defined has as before the property A(E*) = A*(E), but in 
general is not real either on the right or on the left half of the real axis. It 
may also, in principle, possess singularities. 

We shall show, however, that there is nevertheless a class of fields for 
which the function A(E) has no singularity on the physical sheet, although 
the condition (128.6) is not satisfied. 

To do so, we regard y as a function of a complex variable r for a given 
(complex) value of E. Here we need only consider values of E in the upper 
half-plane, since the values of the function A(E£) in the two half-planes are 
complex conjugates. For values of r such that Er? is real and positive, the 
two terms in the wave function (128.1) are of the same order, i.e. we return 
to the situation which occurs when E > O andr is real, when both terms in the 
asymptotic expression for x are legitimate for any field U(r) which tends to 
zero at infinity. We can therefore say that A(E) cannot have singular points 
for values of E such that U(r) > 0 whenr -> œ along a line on which Er? > 0). ' 
When E takes all values in the upper half-plane, the condition Er? > 0 
selects the lower right quadrant in the complex r-plane. 'Fhus we conclude 
that A(E) also has no singularity on the physical sheet when U(r) satisfies the 
conditiont 


U(r) +0 when r > œ in the right half-plane (128.13) 


(L. D. Landau 1961). 

The conditions (128.6) and (128.13) cover a very wide class of fields. We 
can therefore say that the scattering amplitude usually has no singularity in 
the two half-planes. On the left half of the axis (which is part of the physical 
sheet if not cut) the scattering amplitude has poles corresponding to the 
energies of the bound states; when the cut exists, there may be other singu- 
larities also. 


This happens, in particular, in fields of the form 


+ Since U(r) is real on the real axis U(r*) = U*(r); thus the condition (128.13) is satished 
throughout the right half-plane if it is satished in the lower right quadrant. 
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U = constant x r%e~"/a (128.14) 


with any n. On the segment 0 < —E < A2/8ma? of the left half of the axis, 
the condition (128.6) holds, and so there need not be a cut; the scattering 
amplitude has only poles corresponding to the bound states. On the remainder 
of the left half of the axis there may be redundant poles and other singu- 
larities (S. T. Ma 1946). The appearance of these is due to the fact that the 
function (128.14) no longer tends to zero when r > œ along a line on which 
Er? > 0, as soon as E moves below the left half of the axis (i.e. this line falls 
to the left of the imaginary axis in the complex r-plane). 

Next, let us consider the analytical properties of the scattering amplitude 
as |E| > œ. When E > + œ along the real axis, the Born approximation is 
valid and the scattering amplitude tends to zero. According to the above 
discussion, this situation also occurs wheh E tends to infinity in the complex 
plane along any line arg E = constant, if we consider complex values of 7 
for which Er? > 0. If U +0 when r > œ along a line argr = —}arg E, 
and U(r) has no singular point on this line, then the condition for the Born 
approximation to be valid is satisfied and the scattering amplitude again tends 
to zero. When arg E takes all values from 0 to ~, argr takes values from 
Oto — $r. 

We therefore conclude that the scattering amplitude tends to zero at 
infinity in all directions in the E-plane if the function U(r) has no singular 
point in the right half-plane of r and tends to zero at infinity. 

Although we have have spoken throughout of scattering with angular 
momentum / = 0, all the above results are in fact valid for the partial scatter- 
ing amplitudes with any non-zero angular momentum. The only difference 
in the derivations is that, instead of the factors etik" in the asymptotic expres- 
sions for y, we should have to use the exact radial wave functions for free 
motion (33.16). 

Some changes are needed in formulae (128.9) and (128.11) when J # 0. 
Instead of (128.7) we now have 


x1 = rR; = constant x {exp[i(kr — hla + ô1)] — exp[—i(Ar —hlr +8) ]}, (128.15) 


and for the partial amplitude f; (defined according to (123.15)) we obtain 


A 
Ji C+), (128.16) 


ern 


The principal term in the Scattering amplitude near the level E = Eo with 
angular momentum / is given by the formula 





t The limiting form (33.17) of these functions can be used only for E>0; in the rest of the 
E-plane, where the two terms in y are of different orders of magnitude, the use of these 
limiting expressions would involve an error in y which is in general greater than that which 
arises from neglecting U in Schrödinger’s equation. 
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f œ (214+1)fiPi(cos 8) 


h2Ap® 1 
———(2I+ 1)P,(cos 8) (128.17) 


= (—1)#1 
2m E-+|Bo| 





instead of (128.11). 


§129. The dispersion relation 


In the previous section we have studied the analytical properties of the 
partial scattering amplitudes with given values of /, and have seen that these 
properties are complicated by the possible appearance of ‘“‘redundant”’ 
singularities and non-regularity at infinity. The total amplitude, regarded as 
a function of energy for given values of the scattering angle, evidently has 
similar properties. The scattering amplitude for scattering angle zero forms 
an exception, however: as we shall now show, its analytical properties are 
considerably simpler. 

Writing Schrédinger’s equation for the wave function of the particle 
undergoing scattering as 


Apt Rep = (2mU/ hyp, (129.1) 


we may formally regard it as a wave equation with a non-zero right-hand side, 

i.e. as the equation of retarded potentials well known in electrodynamics. 
The solution of this equation which describes the “‘emission’’ in some 

direction k’ at large distances Ro from the centre has the torm (see Fields, 


§66) 





ge Sa EE ie ay 129.2 
= — D sw ee — »F 
Use Ro { p2 ibe dV. ( : ) 


In the present case this represents the wave function of the scattered particle, 
and the coefficient of e‘kRo/Ro gives the scattering amplitude /(6,£). In 
particular, putting k’ = k (where k is the wave vector of the incident particle), 
we obtain the scattering amplitude for scattering angle zero: 


 . le pntke 129.3 
JOE) = - =" | Ube av (129.3) 


(the z-axis being taken in the direction of k). This expression has, of course, 
only formal significance, since the integrand again involves the unknown wave 
function. However, it allows certain conclusions to be drawn concerning the 
analytical properties of the quantity f(0,E) as a function of the energy. E.t 

The function 4 in the integrand consists of two parts when r is large, the 


+ It is assumed, of course, that the field U(r) decreases, as r >æ, sufficiently rapidly for 
f(O,E) to exist (when £>0); see §124 
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incident wave and the outgoing wave. The latter is proportional to er, so 
that the corresponding part of the integral contains e*('-2) in the integrand. 
On the other hand, in the complex plane (going from the upper edge of the cut 
along the right half of the real axis) ik is replaced by —4/(-—2mE)/A, and 
re,/—E > QO everywhere on the physical sheet. Since r > z, re[tk(r~z)] < 0, 
and the integral converges for any complex E. For the incident wave in Y, 
proportional to e‘*?, the exponential factors cancel in the corresponding part 
of the integral, so that this part also converges. 

The function 4 in the integral (129.3) is uniquely defined for all complex 
E as the solution of Schrédinger’s equation which contains, in addition to the 
plane wave, only a part which is damped as r -> œ. The whole of the con- 
vergent integral (129.2) is therefore uniquely determined also, and so its 
singularities can arise only through y's becoming infinite. This occurs at 
discrete energy levels.t 

It is also easy to see that f(0,E) remains finite as |E| > œ. For large 
|Z| the term in U can be neglected in Schrédinger’s equation (129.1), so that 
only the plane wave remains in ws: y ~ e*2. Thus the integral (129.2) 
becomes 


0 = | vay 
f(0,0) = Erl i 


which agrees, as it should, with the Born amplitude (126.4) for scattering 
through zero angle (¢ = 0); we denote it by Fx(0). 

Thus we conclude that the scattering amplitude for scattering angle zero is 
regular over the whole physical sheet (including infinity), except for the 
necessary poles on the left half of the real axis at the discrete energy levels. 

Let us consider the integral 


1 (fQE)-fe p 


2mi E'—E 
C 


; (129.4) 


taken along the contour shown in Fig. 46, which consists of an infinitely 
distant circle and an indentation round the cut along the right half of the 





t To avoid misunderstanding, we should emphasize that here we are discussing the 
complete wave function 4 of the system, normalized by the condition that the coefficient of 
the plane wave in its asymptotic expression should be equal to unity (cf. 123.3)). In the 
previous section we were considering the parts p: of the wave function which correspond to 
definite values of J, and ý; was assumed to be normalized in some arbitrary manner. If we 
expand the complete function ¢ in terms of the functions Yi, the latter will appear in 4 with 
coefficients proportional to 1/B;. For example, the function (128.3) with / = 0 must appear 
in ¢ in the form 


constant 1 9 
à FG + B)eitr— 278 sin kr]. 





Hence ¢ becomes infinite at the zeros of the functions Bi(L£), i.e. at the discrete energy levels. 
t The idea of the foregoing proof is due to L. D. Faddeev (1958). 
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© 


Fic. 46 


real axis. The integral along the circle is zero, since {(0,00)—fp = 0. The 
integration along the two sides of the cut gives 


1 T imf(0,E" 

i AOE) pr 
TT E'—E 

here we have used the fact that, according to the definition adopted in §128, 
the physical scattering amplitude for real positive values of E is given on 
the upper side of the cut, and has the complex conjugate value on the lower 
side. 

According to Cauchy’s theorem, the integral (129.4) is equal to the sum of 
f(0,E)—fs and the residues Ry of the integrand at all the poles E’ = En of 
the function f(0,E')/(E' — E), where En are the discrete energy levels. These 
residues əz ; determined by formula (128.17), and are 
dn hAon? 


f dn = —(—1)*(2h,+1) 


R, = 
"~ EE 2m 








(129.5) 


where ln is the angular momentum of the state with energy En. Thus we find 


1 T imf(0,E’) 


dn 
0,£) = fe+—- | ————dE'+ —_——-. 129.6 
AE = fat | ee 2 TE, (129.6) 


This dispersion relation determines f(0,E) at any point on the physical sheet 
from the values of its imaginary part for E > 0(D. Y. Wong 1957, N. Khuri 
1957). 

When the point E tends to the upper side of the cut, the integral along the 
real axis in (129.6) must be taken by passing below the pole £" = E; if this 1s 
done along an infinitesimal semicircle (Fig. 47), the corresponding part of the 


© 
E 
e—a 
O 
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integral gives iim f(0,E) on the right-hand side of (129.6), while the remain- 
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ing integral from 0 to co must be taken as a principal value. The result is 


1_ ‘Pim f(0,E’) dn 
0,E) = +p | =o ae" a 
re f(0,E) = fa + - eet r (129.7) 





which, for E > 0, determines the real part of the scattering amplitude for 
scattering through angle zero from its imaginary part. It may be recalled 
that the latter, according to (125.9), is directly related to the total scattering 
cross-section. 


§130. The scattering amplitude in the momentum representation 


The concept of the scattering amplitude involves only the directions of the 
initial and final momenta of the particle undergoing scattering. It can there- 
fore naturally be approached also by formulating the scattering problem in 
the momentum representation, where there is no question of the spatial 
distribution of the process. We shall now show how this may be done. 

First of all, let us transform to the momentum representation the original 
Schrédinger’s equation 


— Awe) +[U(t)— EW(r) = 0, (130.1) 
2m 


changing from coordinate to momentum wave functions, 1.e. to the Fourier 
components 


a(q) = Í Y(rje~ dy, (130.2) 


Conversely, 


y(r) = f a(qye** d3g/(27)8. (130.3) 


We multiply equation (130.1) by e~'™" and integrate over dV’. In the first 
term, a repeated integration by parts gives 


f e~a Ay(r)d V = { W(r)Ae4* dV = —q2a(q). 
In the second term, substituting (130.3) for P(r), we obtain 


| Ue)y(x)e** av = | f U(rje“9a(q')e't* dV d3q'/(27)3 


= | U(q—q’)a(q’)d8q'/(27)8, 
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where U(q) is the Fourier component of the field U(r): 


U(q) = i U(r)e-i9* AV. 


Thus Schrédinger’s equation in the momentum representation becomes 





h2 2 . ; , d3 d 
(SE-B )a(q)+ | Ula-aa@sS = 0. (1304) 


Note that this is an integral, not a differential, equation. 
The wave function describing the scattering of particles with momentum 


hk has the form 


u(r) = e= + xlr), (130.5) 


where yą(r) is a function whose asymptotic form (as r +o) is that of an out- 
going spherical wave. Its Fourier component is 


a,(q) = (27)%8(q — k) + x(q), (130.6) 


and substitution in (130.4) gives the following equation for the function 
Xu(Q) =F 


P ae- gala) = Ula-W+ f Ula- axla )dg 2r). (130.7) 


This equation may conveniently be transformed by using instead of y,(q) 
another unknown function defined by 


2m F(k, q) 


Zz 130.8 
h2 g2—k2—i0 oe 


x(q) = 


This eliminates the singularity at q? = k2 in the coefficients of equation 
(130.7), which becomes 


The term 70, which denotes the limit of 78 as 6>+0, is included in the 
definition (130.8) in order to give a definite sense to the integral in (130.9), 


+ For convenience of notation, we write q as an argument of the Fourter component instead 
of as a subscript 

t According to the properties of the delta function, the product (g? —k*)8(q—k) gives zero 
when multiplied by any function f (q) (not having a singularity at q = k) and integrated over 
d3g. In this sense, (¢?—k*)8(q —k) =0. 
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since it establishes the manner of passage round the pole q’? = k? (cf. §43). 
We shall show that this manner of passage in fact corresponds to the required 
asymptotic form of the function 


tm F(k, -gyer" d% 
KE) = Fe | oe pei (ane 





(130.10) 


To do so, we write d°q = q? dq do, and first integrate over do,, i.e. over 
the directions of the vector q relative to r. An integration of this type has 
already been effected in transforming the first term in (125.2); in the region 
of large 7, the result is 





ie _ 2m dni f| F(k, qn‘)e' — F(k, —qn')e*@ q dq 
ne hy g? —k2—10 T 
where n’ = rjr, or 
_ n F(k, gn')jeiar q dg 
Xul) = io | g@—k-i0 ` 


The integrand has poles at the points q = k+70 and ¢ = —k—1i0; the path 
of integration in the complex g-plane passes respectively below and above 
these (Fig. 48a). The path can be moved slightly into the upper half-plane 


© EP OO ee 
e L ne —— .}+---- 
(oa) ( b) 
Fic. 48 
and replaced by a straight line parallel to the real axis together with a closed 
loop round the pole q = k (Fig. 48b). The integral along the straight line 
tends to zero as r > œ (because the integrand contains the factor e~"'™ 1), and 
the integral round the loop is given by 2777 times the residue of the integrand 
at the pole g = k. The final result is 


m eikr 
X(t) = = ——F(kn, kn’), (130.11) 
27h* r 


where n is a unit vector in the direction of k. We have derived the required 
asymptotic form of the wave function, and the scattering amplitude is 


m 


2h? 





f(n, n’) = F(kn, kn’). (130.12) 


Thus the scattering amplitude is determined by the value at q = k of the 
function F(k, q) which satisfies the integral equation (130.9). 
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When perturbation theory is applicable, equation (130.9) is easily solved by 
iteration. In the first approximation, omitting the integral term, we have 
F(k,q) = — U(q—k). In the next approximation, we substitute this in the 
integral term; the scattering amplitude (130.12) is then found to be (with a 
slight change of notation) 


U(k’ -k”)U(k” —k) d3k” 


m 2m 
me Sole i ky 
ISP) aU ) A kaak 2410 (27) 


l, (130.13) 


withk = kn, k' = kn’. The first term is the same as (126.4) in the first Born 
approximation; the second term shows the contribution of the second 
approximation to the scattering amplitude.T 

From (130.13) we have the result already mentioned in §126: even in the 
second approximation, the scattering amplitude does not have the symmetry 
property (126.8). At first sight it may seem that the integral term in (130.13) 
is also symmetrical with respect to interchange of the initial and final states. 
Actually, however, this symmetry does not exist, because the path of in- 
tegration and the direction of passage round the pole are altered when the 
complex conjugate expression is taken. 


§131. Scattering at high energies 

If the potential energy is not small compared with h? jma? (a being as usual 
the range of action of the field), a situation can uccur where the energy of the 
particles undergoing scattering is so large that 


| U| <E~(h?/ma?)(ka)?, = enin 
yet the condition 


| U| 2(A2/ma?)ka = hvja (131.2) 
holds; here it is, of course, assumed that 
ka> 1. (131.3) 


In such a case we have scattering of fast particles to which the Born approxi- 
mation is not applicable; neither of the conditions (126.1) and (126.2) is 
satisfied. 

To examine this case, we can use the expression for the wave function in 


the form (45.9): 


Z 


y = etkzF(r), F(r) = ep( -| Uda); (131.4) 


=- © 


t This result can, of course, also be easily obtained without recourse to the momentum 
representation: the fact that the second-approximation formula differs from the first- 
approximation formula by the replacement of U(k’—k) by the expression in the braces 1n 
(130.13) is evident from a comparison of (43.1) and (43.6). 
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for this to be applicable, the energy need only satisfy the condition |U| < E. 
In §45 it has been noted that this expression is valid only forz < ka®, and so it 
cannot be immediately extended to distances where the asymptotic expression 
(123.3) holds. This is not necessary, however; to calculate the scattering 
amplitude, it is sufficient to know the wave function at distances z such that 
a & z < ka?, and the integral in the exponent of F(r) can be extended to 
infinity: 


p = eikzS(p), (131.5) 


with the notation 


S(p) = expl2i8(p)}, (0) = — -+ | Udz, (131.6) 


p being the radius vector in the xy-plane. 

The scattering of fast particles takes place mainly through small angles, 
and these will be considered here. The change in the momentum fq is 
relatively small (q < k), and hence the vector q may be regarded as being 
perpendicular to the wave vector k of the incident particle, i.e. as lying in the 
xy-plane. The scattered wave is obtained by subtracting from (131.5) the 
incident wave e?*2 (the function (131.4) for z = — œ). The amplitude for 
scattering with wave vector k’ = k+q is proportional to the corresponding 
Fourier component of the scattered wave: 


f ~ | [S(p)— le“ dp 


(d?p = dx dy). The proportionality coefficient in this expression can then 
be derived by comparison with the limiting case of the Born approximation 
(see below). 

The calculation can also be made by a different method, which leads 
directly to a completely definite expression. For this, we use (129.2) and 
substitute y from (131.4). Since, according to (45.8), 


(2m/h2)UF = 2ikdF/éz, 
we obtain for the scattering amplitude (the coefficient of e#*®,/Ro) 


k | 6F » 
p= PEA | eae et dx dy dz 


= al [F(z = o)—F(z = — o)je—*" dx dy. 


+ This method of determining the scattering amplitude is analogous to the one used in the 
discussion of Fraunhofer diffraction (Fields, §61). Diffraction effects make formula (131.4) 
inapplicable for z > ka?. 
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Substituting the expression for F, we have finallyT 


w A] [S(p)— Te“? d2p. (131.7) 


271 


If the energy is so high that 5~|U|a/hv < 1, the Born approximation iS 
valid: expanding S—1 ~ 216, we obtain from (131.7) 


= pe Vested’ pide. 
27h 


in accordance with (126.4). 
Using the optical theorem (125.9), we can derive the total scattering cross- 


section from (131.7). The amplitude for scattering at zero angle is the value 
of f for q = 0. We thus have 


o = f 2re(1-S)d2p = f 4 sin? 8(p) dp. (131.8) 


The integrand may be regarded as the scattering cross-section for particles 
with impact parameter in the range d?p.f 

Formula (131.7) does not presuppose that the field has central symmetry. 
It is instructive to see how this formula can be derived, for a centrally 
symmetric field, directly from the exact general formula (123.11). 

Under the conditions (131.1)-(131.3), the main role in the scattering is 
played by the partial amplitudes with large angular momenta /. The quasi- 
classical condition is therefore satisfied for the wave functions, and we can 
use formula (124.1) for ô. Putting there ro ~ ljk, r? = 224 ]2/k2, we find 


mf Uar __m ( 24 [2/h2)}d 
8 © —ae | Te -PPP) im [uive ste 
lik 0 


+ In the two-dimensional case, the scattering amplitude in the field U(x, z) is determined, as in 
§123, Problem, by the analogous formula 


poh IE [ts — lle~™ dx. (131.7a) 
i 27 


The square | |? do is the scattering cross-section per unit length along the y-axis, and ¢ is the angle 
of scattering in the xz-plane; cf. also §126, Problem 6. 


t In §152:a generalization of formulae (131.7) and (131.8) to the case of scattering by a 
system of particles will be given. 
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in accordance with the value of 6(p) (131.6) when p = J/k.t Next, for small 


angles (0 < 1) the Legendre polynomials with large / can be put in the form 
(49.6): 


2n 
Pi(cos 8) = Jo(8l) = A eres ate 
TT 


0 


Substituting this in (123.11) and changing from summation (over large l) to 
integration, we obtain 


2yr 
fs =| | fe c © dd Id! 
TT 
0 


Re | 
aes | fie >P d2p, (131.9) 
TT 


where q and p are two-dimensional vectors with magnitudes g = RO, p = IJR. 
Lastly, substituting for fı in the form (123.15) with 6; = 8(//k), we return 
to (131.7). 

For scattering in a centrally symmetric field, formula (131.7), after the 
integration over the polar angle ¢ in the xy-plane (d?p = p dp dd) has been 
carried out, becomes 


f = —ik | {exp[2i8(p)]-1}Jo(gp)p dp. (131.10) 


It has already been mentioned in §126 that the Born approximation is not 
applicable to the scattering of fast particles through large angles if the cross- 
section is exponentially small. The method given here is also inapplicable 
under these conditions. Such cases are actually quasi-classical, and pertur- 
bation theory cannot be used. 

In accordance with the general rules of the quasi-classical approximation 
(cf.§§52, 53), the exponent in the exponential law of decrease of the scattering 


t The quasi-classical function 248(p) is the change in the action, caused by the field U, 
when the particle traverses a classical path. For a fast particle, this path may be taken as a 
straight line, and 28(p) is then the difference of the classical action integrals 


2mU T m F 
IRI z) fre mm | Ue 
-®© Tw —“to 


In this sense, the function 28(p) here acts like the eikonal in geometrical optics. The approxi- 
mation in scattering theory is therefore often called the etkonal approxtmation. It must be 
emphasized, however, that the scattering amplitude does not reduce to its quasi-classical 
value, since the conditions 6] > 1, 5; > 1 are notin general satished. 
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cross-sections can be determined by considering “complex paths” in the 
classically inaccessible region of motion.t 

In the classical scattering problem the relation between the angle of devia- 
tion @ of the particle in a field U(r) and the impact parameter p is given by 


p dr 


72/1 — p%/r2—UJE) (131.11) 


Hn F6) Si 


To 
where 79 is the minimum distance from the centre, a root of the equation 
1—p%/r?— UJE = 0; (131.12) 


see (127.5). The case of interest to us corresponds to the range of angles 
which cannot occur in the scattering of a classical particle.} \ These angles 
therefore correspond to complex solutions p(@) of equation:(131.11)| (with 
corresponding complex values of ro). From the function p(@) thus found and 
the classical orbital angular momentum mvp of the particle we calculate the 
action 


S(6) = mo Í p(0) dd, (131.13) 
where v is the velocity of the particle at infinity. The scattering amplitude is 


fr exp(— = im S(0)) (131.14) 


Equation (131.12) has in general more than one complex root. The value of ro 
in (131.11) must be taken as that root which gives the smallest positive 
imaginary part im S. In addition, if the function U(r) has complex singu- 
larities, they must also be considered as possible values of ro. | 

The region r ~ ro is the most important in the integral (131.11). For large 
energies E, the term UJE under the radical can be omitted. Carrying out the 
integration, we then have 


Pp = To cos $ô. (131.15) 


If ro is a singular point of the function U(r), it depends only on the pro- 
perties of the field, but not on p or E. Calculating S from (131.13), we find 
in this case that the scattering amplitude is 


Dmu 





f~ exp( - sin $8 imra). (131.16) 


+ A discussion of the coefficient of the exponential is given by A. Z. Patashinskii, V. L. 
Pokrovskii and I. M. Khalatnikov, Soviet Physics JETP 18, 683, 1964. 

t The method described here is valid not only for large E but generally for all cases of 
exponentially small scattering. 

i| It may be recalled (see §126) that, if U(r) has a singularity for real r, the decrease of the 
croass-section is not exponential. 
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If, however, ro has to be taken as a root of equation (131.12), the form of the 
exponent depends on the particular properties of the field. For example, 


with the function 
U = Upe-(r/e) 


(which has no singularity at a finite distance) we obtain from the equation 


UJE = 1— p2/r2 = sin2h6 
the result 
To = iay log([E/Up] sin? $6). (131.17) 


Owing to the very slight dependence on 0, ro may be regarded as constant in 
the integration in (131.13), and we find for the scattering amplitude the 
formula (131.16) with 79 given by (131.17). 


PROBLEMS 


PROBLEM 1. Determine the total scattering cross-section for a spherical square potential 
well of radius a and depth U, with the condition (131.1): Uy < ħ?k?]m. 


SOLUTION. We have 
Í Ud: = -2Uov(a?— p?). 
According to (131.7), the forward-scattering amplitude (q = O)is 


~z, | [ee (== v(a? — e) — | . 27n p dp 
2n hv 
0 


i 
~ika? Í (eĉi»z — 1)x dx 


0 


i] 


f (0) 


ll 


ez v 


r i iv 
= im [i- —- (e -1)], 


where v = U,a/hv is the “Born parameter’. By means of the optical theorem (125.9), we 
hence find the total cross-section: 


a 2mat| 1 ae _ sin 2v _ COS >| 


2ve v 2v2 








In the limiting (Born) case v < 1, this gives o = 27a*v?, in agreement with §126, Problem 1. 
In the opposite limiting case v > 1, we have simply o = 27a?, i.e. twice the geometrical 
cross-section. The latter result has a simple significance. For v > 1, all particles with impact 
parameter p <a are scattered, i.e. are removed from the incident beam. In this sense the well 


behaves as an “absorbing” sphere; and, according to Babinet’s principle (see Fields, end of 
§61), the total cross-section is twice the “absorption” cross-section. 


PROBLEM 2. The same as Problem 1, but fora field U = Ue exp(—r*/a’), 
SOLUTION. In this case 


x 


f U dz = ar/nl'o exp (— p®/a2). 


-% 
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Substituting in (131.7) and making an obvious change of variable in the integral, we obtain 


for the scattering amplitude at zero angle 


vda 
f(0) = —}ika? f (e-tu —1) dufu, 


where again vy = U,a/fiv. Hence the total cross-section is 


vf 
o = 2ra? f (1 —cos u) du/u. 
o 


For v < 1 the integrand is łu and the cross-section o = $ra?»?, in accordance with the 
result of §126, Problem 2 (for ka > 1). For v > 1, we write the integrand as (1 —e7 4# cos u)/u 
with a small parameter 4, which afterwards is made to tend to zero. Integration by parts then 
gives 


vf® a 
Í (1 — cos u) duju = log{v y r)-— f log u sin u du 
0 0 


= log (vn) +C, 


where C is Euler’s constant. Thus 


a = 2ra? log (y yneC) for »> 1. 


PROBLEM 3. Determine the cross-section for small-angle scattering of electrons in a 
magnetic field concentrated in a cylindrical region of radius a (Y. Aharonov and D. Bohm 
1959). 


SOLUTION. Let the magnetic field be along the y-axis, which is also the axis of the cylin- 
drical region, and let the direction of incidence of the electrons be taken as the z-axis. Then 
the scattering is independent of the coordinate y, and we can consider a two-dimensional 
problem in the xz-plane. 

Outside the cylindrical region, the field H = 0, but the vector potential is not zero: 


A = (®/27)Vġ, (1) 


where ¢ is the polar angle in the xz-plane and @ the magnetic flux; for, integrating over the 
area of a circle (with radius r >a) in this plane, we have 


J H dx dz 


$A . dl = [Od/2n}" = Q. 


The potential (1) changes the phase of the electron wave function (plane wave); according to 
(111.9), we have 


Yy = etk? exp (1eDd/2rhc). (2) 
This expression, however, is inapplicable in a narrow region along the half-axis z> 0, since 
the motion of the particles that have passed through the field region is perturbed by the field. 
This explains the apparent non-uniqueness of the function (2) when the angle ¢ increases by 
2m as we pass round the origin. In reality, there is a cut (of finite width) near the half-axis 
z>0, resulting from the invalidity of formula (2); on the two sides of the cut, ġ has values 
differing by 27, for example + 7. 

For scattering at small angles @ with a small momentum transfer gwk@ (qa € 1, 0 < 1), 
teanaverse distances x~ 1/q B a are important, and the width of the cut may be neglected. 
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Considering the region z > |x|, we can also neglect the dependence of 4 on x on either side 
of the z-axis, obtainingt 


yb = AFl), F(x) 


exp( —te0/2hc), x >0, 


| (3) 
exp(zeD/2hc), x< 0. 


The “two-dimensional” scattering amplitude is calculated from formula (131.7a).J For 


g#0 we have 
l jk ieD ae 
f =F dine exp are e~" dx + complex conjugate }. 
0 


The integral is calculated by including a factor e~42 and then taking the limit A>0. The 
result is 


fab joes 
n 2ħc 
Hence the scattering cross-section is 
age iy? de-= Z sin? = 6/62. (4) 
For e/he < 1, we obtain 
e? dé 


T DeRh2ee 62” 


which corresponds to the case where perturbation theory is applicable. 

Note the periodic dependence of the cross-section (4) on the magnetic field strength, and 
the divergence of the total cross-section when 8—0, although the field is concentrated in a 
finite region of space. These are both specifically quantum effects. 


§132. The scattering of slow particles 


Let us consider the properties of elastic scattering in the limiting case 
where the velocities of the particles undergoing scattering are so small that 
their wavelength is large compared with the radius of action a of the field U(r) 
(i.e. ka < 1), and their energy is small compared with the field within that 
radius. The solution of this problem requires an elucidation of the limiting 
form of the dependence of the phases 5; on the wave number k when the 
latter is small. 

For r < a we can neglect only the term in A2? in the exact Schrédinger’s 
equation (123.7): 


Ry" +2Ry'/r—Ul41)Rify? = 2mU(y)Ry| k2. (132.1) 


In the rangea <r <1 /k, on the other hand, we can also omit the term in 
U(r), leaving 
Ry +-2Ry jr—1(1+ 1)R,/r? =, (132.2) 





t Formula (3), like (131.4), 
important. _ 

t This formula (for q0) can, as already me 
Schrédinger’s equation in the potential field. 


is not applicable for very large z, when diffraction effects become 


ntioned, also be derived without using 
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The general solution of this equation is 


Ry = arl+eefri +, (132.3) 


The values of the constants c, and cg can in principle be determined only by 
solving equation (132.1) for a particular function U(r); they are, of course 
different for different J. At still greater distances, r ~ 1/k, the term in 
U(r) can be omitted from Schrédinger’s equation, but the term in k? cannot 
be neglected, so that we have 





2 (I41 
E E A E - Jr =0, (132.4) 
y Y 


i.e. the equation of free motion. The solution of this equation is (see §33) 





(21+1)!! d \?sinkr 
pE kel +l (5 PS 
rl d \! coskr 


The constant coefficients have been chosen so that, for kr < 1, this solution 
becomes (132.3); this ensures the “joining” of the solution (132.3) in the 
region kr < 1 to the solution (132.5) in the region kr ~ 1. 
Finally, for kr > 1 the solution (132.5) takes the asymptotic form (§33) 
sin(kr —}l7) cok! 


zs a =A), 
Rı c(2l+ 1) ETIR Tace ” 41:7) 


This expression can be put in the form 
sin(kr—ġlr+ ô) 


r 


Rı = constant x 


(132.6) 
where the phase ô, is given by the equation 
tan 8; æ $r = caka (21—1)! (2141)! (132.7) 


(since k is small, all the phases 6; are small). 
According to (123.15) the partial scattering amplitudes are 


1 
fi oa ) = ôi 


and so we conclude that in the limit of low energies 
fr ~ Ret. (132.8) 


Thus all the partial amplitudes with / # 0 are small compared with the 
scattering amplitude with Z = 0 (called s-wave scattering). Neglecting them, 
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we obtain for the total amplitude 


fO) = fo = Sofk = cele) = —a, (132.9) 
so that dg = «? do, and the total cross-section is 
o = 47a’, (132.10) 


At low velocities the scattering is isotropic, and the cross-section is inde- 
pendent of the particle energy.t The constant a is called the scattering 
length; it may be either positive or negative. 

In the above discussion it has been tacitly assumed that the field U(r) 
decreases at large distances (r > a) sufficiently rapidly for the approximations 
made to be legitimate. It is easy to see how rapidly U(r) must in fact decrease. 
For large r, the second term in the function R; (132.3) is small in comparison 
with the first. In order for the retention of this term to be nevertheless 
legitimate, the small terms ~cg/r!+1r? retained in equation (132.2) must 
be large compared with the term UR; ~ Ucyr! omitted in going from (132.1) 
to (132.2). Hence it follows that U(r) must decrease more rapidly than 1 /r2!+3 
if the result (132.8) is to be valid for the partial amplitude fı. In particular, 
the calculation of fo, and therefore the result (132.9) of isotropic scattering 
independent of energy, are valid only when U(r) decreases at large dist- 
ances more rapidly than 1/7°. 

If the field U(r) decreases exponentially at large distances, we can draw 
certain conclusions regarding the nature of the subsequent terms in the 
expansion of the amplitudes fı in powers of k. We have seen in §128 that in 
this case the amplitude fı, regarded as a function of the complex variable 
E, is real when E is real and negative.{ The same is therefore true of the 
function g;(£) in the expression (125.15): 


fi = 1/(g:—ik) 


(ik is real for E < 0). The function g)(E) is also real (by definition) when 
E > 0. Thus this function is real for all real E, and can therefore be expanded 
in integral powers of E, i.e. in even powers of k. The amplitude fi(k) itself, 
therefore, can be expanded in integral powers of ik; all terms with even 
powers of k are real, while those with odd powers of k are imaginary. Accord- 
ing to (132.8) the expression of f;(k) begins with the term ~6)/k ~ k2!; 
accordingly, the expansion of g;(k) begins with a term proportional to k2. 

When the field decreases at large distances according to a power law 


U = r-r with n < 3, the result (132.9) that the amplitude is constant is, 
as already stated, invalid. 


t In the scattering of electrons by atoms, the length a with which 1/k must be compared (the 
condition ka < 1) is represented by the radius of the atom, which is several times the Bohr 
radius (several times #?/me*) for complex atoms. Owing to the large value of this radius, the 
constancy of the effective cross-section actually applies here only up to energies of the Ka des 
of fractions of an electron-volt; at greater electron energies there is a marked energy 
dence of the cross-section (called the Ramsauer effect). 

Í For small E, the condition (128.6) is satisfied even when U decreases as e-7/2, 


depen- 
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Let us now consider the situations which occur for various values of n. 
For n < 1 and sufficiently small velocities, the condition 


o|U(p)| > he (132.11) 


is satisfied for practically all values of the impact parameter p, and so the 
scattering is described by the classical formulae (cf. the condition (127.9)). 

For 1 < n < 2 the inequality (132.11) is satisfied over a considerable 
range of fairly small values of p; accordingly, the scattering is classical for 
angles which are not too small. There is also, however, a range of values of 
p for which 


p|U(p)| < ho, (132.12) 


i.e. the condition for perturbation theory to be valid is satisfied (cf. (126.2)). 
For n > 2 the inequality 


[U| < h?/mr? (132.13) 


holds at large distances, and therefore the contribution to the scattering 
which arises from interaction at these distances can be calculated by means of 
perturbation theory (whereas at smaller distances the condition for perturba- 
tion theory to be applicable may not be satisfied).t Let ro be a value of 7 
such that for r > ro the inequality (132.13) holds, while ro < I/k. The 
contribution to the scattering amplitude from the region r > ro Is, according 
to (126.12), given by the integral 


(sa) : (sa) : 
2mB f 1 sin Me "TEND 2mß soi sin $ 
he rn qr he &n—l 


To QTro 


Ce ei ] —— 





dé. (132.14) 





For 2 < n < 3 this integral converges at the lower limit, and for low 
velocities (kro < 1) we can replace this limit by zero, so that the integral 
is proportional to g~@-), i.e. a negative power of the velocity. This contri- 
bution to the amplitude is therefore in this case the main one, so that 


fuort, Jen}. (132.15) 


This determines the dependence of the scattering cross-section on the velocity 
of the particles and on the angle of scattering. 

Forn = 3 the integral (132.14) diverges logarithmically at the lower limit. 
It is still the main part of the scattering amplitude, so that 


f~ log(constant/g), 7 = 3. (132.16) 


For n > 3 the contribution from the region r > ro decreases as k +0, 
and the scattering is determined by the constant amplitude (132.9)). However, 
the contribution (132.14), despite its relative smallness, 1s still of some 
interest through being “anomalous”. The “normal’’ situation when U(r) 


+ The scattering at low velocities ts in this case nowhere quasi-classical, since the inequality 
(132.11) is incompatible with the simultaneous requirement that | Ulol < E. 
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decreases sufficiently rapidly is that f(k) can be expanded in integral powers 
of k, and all the real terms in the expansion are proportional to even powers 
ofk. When the integral (132.14) is ‘ntegrated several times by parts (lowering 
the power of £ in the denominator), we can separate from it a part containing 
even powers of k and leave an integral convergent as gro —> 0 and proportional 
to k”-3, which is not in general an even power.t 


PROBLEMS 
PROBLEM 1. Determine the scattering cross-section for slow particles in a spherical square 
potential well of depth U, and radius a. 


SOLUTION. The wave number of the particle is assumed to satisfy the conditions ka < 1 
and k <x, where x = 4/(2mUo)/h. We are interested only in the phase ôo. Hence we put 
= 0 in equation (132.1), and obtain for the function x(r) = rR,(7) the equation 


x” +r’ = Oforr <a. 
The solution which vanishes at r = 0 (x/r must be finite at r = 0) is 
x = Asin«r (r <a). 


For r > a, the function y satisfies the equation x” +k?°x = 0 (i.e. equation (132.4) with 
l = 0), whence 


x = B sin(kr +80) (r >a). 
From the continuity of y’/x atr = a, we obtain the equation 
x cot ka = k cot(ka + ôo) œ k/(kat+ ôo), 


from which we determine ĝo. As a result, we have for the scattering amplitude} 


tan kä Ka 


K 


For xa €1 (i.e. Uo < }?/ma?) this formula gives o = (4ra?/9)(xa)t, in accordance with 
the result of the Born approximation (see §126, Problem 1). 


PROBLEM 2. The same as Problem 1, but for scattering by a spherical ‘‘potential hump” 
of height Ulo 


SOLUTION. The solution is obtained from that of Problem 1 if we change the sign of U, 
(which means replacing x by ix), and obtain for the scattering amplitude 


tanh «a— Ka 


K 
In the limit «a > 1 we have 


f= — a, o = ģna. 


t If 7 is an odd integer 2p +1, then n—3 = 2p—2 is an even number. In this case also 
however, the integral (132.14) has an “anomalous” part, which gives a contribution to the 
scattering amplitude proportional to g?? —? iog q. 

t This formula becomes inapplicable if the width and depth of the well are such that xa is 
close to an odd multiple of 47. For such values of xa the discrete spectrum of negative energy 
levels includes one which is close to zero (see §33, Problem 1), and the scattering is described 
by formulae which we shall derive in the next section. 
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This corresponds to.scattering from an impenetrable sphere of radius a; we note that classical 
mechanics would give a result four times smaller (o = 7%). 


PROBLEM 3. Determine the scattering cross-section for particles of low energy in a field 
U = aj", d >00 n> 3. 


SoLtuTion. Equation (132.1) with / = 0 is 
x yx = 0, y = vV/(2ma)/h. 
By the substıtutions 
x SGT, r = [2yi(n—2)x]?in-3 


it can be brought to the form 


d*4 Id 
a2 a l rk = QO, 


dx? xdx (11 — 2)2x2 


i.e. Bessel’s equation of order i/(mn—2), with imaginary argument ix. The solution which 
vanishes at ry = 0 (i.e. x = œ) is, apart from a constant factor, 


x = VrHyn-2\(2iyr-22|(n—2)). 
Using the well-known formulae 


HyV(z) = tle?"] p(z)—J-n(2)]/sin pr, 
J2) = 27/270 (p+) (& <1), 


we obtain for the function y at large distances (y <r <1/k) the expression 
x = constant X (cır +ca), and from the ratio cz/¢, we find the scattering amplitude 


eV 0 3)/(2—2)] 
f= E P(r —1)i(n—2)] 


Proptem 4. Determine the scattering amplitude for slow particles in a field which 
decreases at large distances as U % pr” with 2 <n < 3. 


SoLuTION. The principal term in the scattering amplitude is given by the expression 
(132.14), in which the lower limit in the integral can be replaced by zero. The calculation 
of the integral leads to the result 


amp gn-s 
= ; PAS ie, We oa (1) 
R2 T(n—1)cos han 











and for n = 3 
2m3 constant 


=~ log — 
f nee 








(2) 


Expanding (1) in Legendre polynomials, we obtain the partial scattering amplitudes 
(defined in accordance with (123.14): 


qnm Pan-prd-wet+) kn-3 


as (3) 
2h Pan) Gnt342) 


j=- 


For n > 3 the same formula (1) determines the “‘anomalous’’ part of the scattering amplitude. 
In the partial amplitudes the quantity (3) is always the principal part for values of / such 
that 21> n—3, and instead of (132.8) we then have firw kr. 


ProBLEM 5. Determine the scattering amplitude for slow particles in a field U(r) = 
— U, exp(—r/a), Up > 0. 


SOLUTION. After the change of variable 


x = areia, x = of(2mUp)ih, 
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equation (132.1) for the function x = 7R, becomes 


2 
SK l A À 
dx? x dx 


The general solution of this equation is 
x = AJo(x) + BNo(x), 
where J, and No are the Bessel functions of the first and second kinds. The condition x = 0 
for r = Q0 gives 
AJB = —No({2«a)j Jo(2«a). 
The region a < r < 1/k corresponds to x < 1 (it is assumed, of course, that ax exp(—1/ak) 
< 1; here 
Br 


ys A+B log yx = gag Kay — —, 
T T Ta 


where y = eC = 1°78 ... (C is Euler’s constant). This expression corresponds to formula 
(132.3), and from the values of c, and c; thus obtained we find the scattering amplitude 


Te -a(T +2 log kay) 


aT 
J o(2xa) 


In the limit xa < 1, f = 2a?r?, in accordance with the Born approximation (126.14). For 
ka > 1 we have f = —2a log xay. 





| No(2xa) -- log kay Jol2) | 


PROBLEM 6. Determine, in the second approximation of perturbation theory, the scattering 
ampljtude in the limit of low energies (I. Ya Pomeranchuk 1948). 


SOLUTION. For k->0 the integral in the second term of (130.13) becomes 


-| U -w Ure d3k" = - | fumu o Sav dV’ 


k2 
= -2m2 | | COU) ay ap, 
|r- r'| 
here we have used the formula 
gk) 4m dèk _ l 
k2 (27)  |r-r'] 
see Fields, §51. Thus the scattering amplitude is 
m m \2 U(r)U(r') ; 
= ~——| UdV+{(— dV dy’. 1 
J a (2) {| jr—r' (i) 


For a central field, this formula gives 


2 
f = -2| Ur? darn ff O(r)U(r')r? dr . r' dr’. 


The second term in (1) is always positive (as is clear from the original form of the integral in 
k-space). Ina repulsive field (U > 0), the first Born approximation therefore always gives too 
high a value, and in an attractive field (U < 0) too low a value, for the scattering cross-section 
at low energies. 





PROBLEM 7. Determine the energy dependence of the scattering amplitude for slow particles in 
the two-dimensional case. 

So.ution. The wave function at large distances in the two-dimensional case is given by (1) in 
§124. Problem. Arguments similar to those for the three-dimensional case show that the main 
contribution to the scattering at low energies comes from the state with m = 0, so that the scattering 
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amplitude f is independent of the scattering angle ¢. The wave function can then be represented 
at all distances p > a by simply replacing e”/,/p by the exact solution of Schridinger’s equation 
for free motion, which has the same asymptotic form; see the first footnote on page 16l and §126, 
Problem 6. Thus 


Y =e" + f (7k) . iH (kp). (1) 
Here we take small distances p < l/k, using the approximate expression for HÌ! (x) when x is small: 
Hy) (x) œ — (2i/m)log(2i/yx), |x| <1, 


where y = e" and C is Euler’s constant. The result is 


Y [1 + f./(2k /m) log (2i /yk)] — fy (2k jr) log p. (2) 


This agrees, as it should, with the general solution of the equation 


hid dy 
2m p dp P dp > 


valid in the range | /k > p > a where the U(x) and E terms in Schrédinger’s equation are negligible: 


Y œ% ci + cy log p. 


As in (132.3) and (132.9), the ratio of constants c /c. is given by the solution of Schrédinger’s equation 
with £ = 0 in the range p ~a. This ratio is real and independent of the energy. With the notation 


cje = —log ro, (3) 
where 7, is a constant having the dimensions of length, a comparison of (2) and (3) gives 


l 


a ayia)” 


whence the cross-section is 


o = 2n| f1? = (x7/k) (4) 


log? (2/ykry) + in? 


We see that in the two-dimensional case, unlike the three-dimensional one, the cross-section increases 
with decreasing energy. 

For scattering by an infinitely high cylindrical potential barrier with radius a, the constant 7) in 
(3) is equal to a. 


§133. Resonance scattering at low energies 


Particular consideration must be given to the scattering of slow particles 
(ka < 1) in an attractive field when the discrete spectrum of negative energy 
levels includes an s state whose energy is small compared with the value of the 
field U within its range of action a. We denote this level by « (e > 0). The 
energy E of the particle undergoing scattering, being small, is close to e, 
i.e. it is, as we say, almost in resonance with the level. This leads, as we shall 
see, to a considerable increase in the scattering cross-section. 

The existence of the shallow level can be taken into account in scattering 
theory by means of a formal method based on the following arguments. 

In the exact Schrédinger’s equation for the function y = rRo(with/ = 0), 


x" + (2m/h?)[E— U(r) ]x = 
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in the “inner” region of the field (r < a) we can neglect E in comparison 
with U: 


x” —(2m/h?)U(r)x = 0, r~a. (133.1) 
In the “outer” region (r > a), on the other hand, we can neglect U: 
x" +(2m|h?)Ex = 0, r>a (133.2) 


The solution of equation (133.2) must be “joined” at some r, (such that 
1/k > rı > a) to the solution of equation (133.1) which satisfies the houndary 
condition x(0) = 0; the joining condition is that the ratio y'/y should be 
continuous. This ratio does not depend on the normalization factor in the 
wave function. 

However, instead of considering the motion in the region r ~ a, we apply 
to the solution in the outer region a suitably chosen boundary condition on 
x [x for small r; since the solution in the outer region varies only slowly 
as r -> 0, we can formally apply this condition at the point r= 0. The 
equation (133.1) for the region r ~ a does not contain E; the boundary 
condition which replaces it must therefore also be independent of the energy 
of the particle. In other words, it must be of the form 


[x'/x]r-0 = —«, (133.3) 


where « is some constant. But, « being independent of E, the same condition 
(133.3) must also apply to the solution of Schrédinger’s equation for small 


negative energy E = —|e|, i.e. to the wave function of the corresponding 
stationary state of the particle. For E = —|e| we have from (133.2) 
y = Age mlieh, (133.4) 


where Ap is a constant, and substitution of this function in (133.3) shows that 
k is a positive quantity, 
K = y/(2m|e|)/hħ. (133.5) 


Let us now apply the boundary condition (133.3) to the wave equation for 
free motion, 


= constant x sin(kr + 89), 


which is the exact general solution of equation (133.2) for E > 0. Thus we 
have for the required phase ŝo 


cot do = — x/k 


= -—/(lel/E). (133.6) 


Since the energy E is here restricted only by the condition ka < 1, and need 
not be small compared with |e], the phase Sg and the s-wave scattering 
amplitude may not be small. 

The phases 6; with Z > 0 and the corresponding partial amplitudes are 
again small. Hence we can again regard the total amplitude as being equal to 
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the s-wave scattering amplitude 
1 
J 2ik 


= 1/R(coté9—2), 
Substituting (133.6) we obtain 
f= —1/(x+ik) (133.7) 
and for the total scattering cross-section 


4-7 2rh2 1 
o 4k om Ete] 





(133.8) 


Thus the scattering is again isotropic, but the cross-section depends on the 
energy, and in the resonance region (E ~ |e|) is large compared with the 
squared range of action of the field a? (since ka « 1). The form of (133.8) is 
not affected by the details of the interaction of the particles at small distances, 
and depends only on the value of the resonance level. 

The above formula is somewhat more general than the assumption made 
in its derivation. Let the function U(r) be slightly modified; this alters also 
the value of the constant « in the boundary condition (133.3). By an appro- 
priate change in U(r), x can be made to vanish, and then to become small and 
negative. This gives the same formulae (133.7) for the scattering amplitude 
and (133.8) for the cross-section. In the latter, however, the quantity 
|e] = h®x2/2m is now simply a constant characteristic of the field U(r), and 
not an energy level in that field. In such cases the field is said to have a 
virtual level, since, although there is no actual level close to zero, a slight 
change in the field would be sufficient to cause one to appear. 

In the analytical continuation of the function (133.7) in the complex plane 
of E, ik becomes — 4/(— 2mE)/hi on the left half of the real axis (see §128), and 
we see that the scattering amplitude has a pole at E = —|e|, in accordance 
with the general results of §128. On the other hand, the virtual level corres- 
ponds, as we should expect, to no singularity of the scattering amplitude on 
the physical sheet. (The scattering amplitude has a pole at E = —|e| on the 
non-physical sheet; see the first footnote to §128.) 

Formally, the expression (133.7) corresponds to the case where in the 


expression (125.15), 
1 


f= volk) — ik" 


the first term in the expansion of the function go(A) is negative and anomalously 
small. To refine the formula, we can take account of the second term 1n the 


expansion: 


+ Formula (133.8) was first derived by E. Wigner (1933); the idea of the derivation given 
here is due to H. A. Bethe and R. E. Peierls (1935). 
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1 

= ee 9 
eres a alae 
(L. D. Landau and Ya. A. Smorodinskii 1944); it may be recalled that, 
when the field decreases sufficiently rapidly, the functions g;(k) can be ex- 
panded in even powers of k (see §132). In (133.9) we have denoted by — xo 
the value of go(0), in order to retain the notation « for the quantity (133.5), 
which is related to the energy level «. According to the above discussion, « 
is given by the value of —zk which makes the denominator in (133.9) equal to 

zero, i.e. by the root of the equation 


k = kot rog? (133.10) 


The correction term rok? in the denominator in (133.9) is small compared 
with «xo, since & is assumed small, but it is itself of “normal’’ order of magni- 
tude: the coefficient 79 ~ a and is always positive (see Problem 1). It should 
be emphasized that the inclusion of this term is a legitimate refinement in 
the formula for the scattering amplitude when contributions from angular 
momenta / # 0 are neglected; it gives a correction to f of relative order ka, 
whereas the contribution from scattering with / = 1 is of relative order 
(ka). When k ->0, the amplitude fo >1/xo, i.e. 1/9 is equal to the scattering 
length « defined in §132. The coefficient rọ in the formula 


go(k) = k cotdo 
= —1fe-+}rok (133.11) 


is called the effective range of the interaction.T 
For the cross-section we have, from (133.9), 


ree 


If we neglect the term in k4 in the denominator (though it may legitimately 
be included), this formula can be written (using (133.10)) in the form 
4n(1+rox) 47h? 147 0K 
oc = Ch = 


se (133.12) 
k24 r? m E+|e| 


Let us return to the expression (133.4) for the wave function of the bound 
State in the “outer”? region, and relate the normalization coefficient to the 





t The values of the constants x and rọ may be mentioned for the important case of the 
interaction of two nucleons. For a neutron and a proton with parallel spins (isotopic state with 
T = 0),a = 5:4 x10-" cm, ro = 1:7 X10-™3 cm; these correspond to a true level with energy 
|e] = 2:23 MeV, the ground state of the deuteron. For a neutron and a proton with anti- 
parallel spins (isotopic state with T = 1), x = —24x10-® cm, rg = 2:7 X10-!3 cm; these 
values correspond to a virtual level with |e| = 0-067 MeV. Owing to isotopic invariance, the 
latter values must apply also to a system of two neutrons with antiparallel spins; parallel spins 
of the mm system in the s state are prohibited by Pauli’s principle. 
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parameters defined above. On calculating the residue of the function (133.9) 
at its pole E = «e and comparing with (128.11), we find 


— = —-—}ro. (133.13) 


The second term is a small correction to the first, since xk7g~xa < 1. 
Without this correction, Ag* = 2x, 1.e. 





= a/(2n)e-*F ME Be EG ET 
y = V(2n)e*", y Haale T = (133.14) 


corresponding to the normalization that would occur if (133.14) were valid 
in all space. 

We shall briefly discuss resonance in scattering with non-zero orbital 
angular momenta. The expansion of the function g;(k) begins with a term 
~k; retaining the first two terms in the expansion, we write the partial 
scattering amplitude as 


1 


f= ~ BE“ — e+ E)+ik’ 


(133.15) 


where b and e are two constants, with b > 0 (see below). The case of resonance 
corresponds to an anomalously low value of the coefficient of E~}, i.e. an 
anomalously small e. However, since E is small, the term beE-! may still 
be large in comparison with k. 

If « < 0, the denominator in the expression (133.15) has a real root 
E = —|e|, so that e is a discrete energy level (with angular momentum /),f 
but in contrast to resonance in s-wave scattering the amplitude (133.15) is 
never large compared with a; the amplitude of resonance scattering with 
angular momentum /+1 is only of the same order of magnitude as that of 
non-resonance scattering with angular momentum /. 

If € > 0, however, the amplitude (133.15) becomes of the order of 1/k 
in the region E ~ «, i.e. large compared with a. The relative width of this 
region is small: AE/e ~ (ka)?!-1. Thus in this case there is a sharp resonance. 
This type of resonance scattering occurs because a positive level with Z # 0, 
though not a true discrete level, is quasi-discrete: owing to the presence 
of the centrifugal potential barrier, the probability that a particle of low 
energy will escape from this state to infinity is small, so that the “lifetime” 
of the state is long (see §134). This is the reason why resonance scattering 
with Z Æ 0 is different in nature from that in the s state, where there is no 
centrifugal barrier. The denominator in (133.15) with e > 0 vanishes when 


+ For «<0, and E close to |e|, 
fi = (— 1) Ne|/b(E+ el). 


A comparison with (128.17) shows that 6>0, 
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E = Eo—i. 4I, where 


_ 2v (2m) 


Eo & €, E 
: bA 


eft1i2, (133.16) 


This pole of the scattering amplitude is, however, on the non-physical sheet. 
The small quantity F is the width of the quasi-discrete level (see §134). 
Finally, we may mention an interesting property of the phases ê; which is 
easily derived from the above results. We shall regard the phases 5;(Z) as 
continuous functions of the energy, and not restrict them to the range from 
0 to ~ (cf. the footnote following (33.20)). We shall show that the equation 


5:0) — ê) = 777 (133.17) 


then holds, where is the number of discrete levels with angular momentum / 
in the attractive field U(r) (N. Levinson 1949). 

To prove this, we note that, in a field which satisfies the condition 
|U| < h®/ma?, the Born approximation is valid at all energies, so that 5;,(E) < 1 
for all Z, and 8;,(c) = 0, since for E -> œ the scattering amplitude tends 
to zero, while 6,(0) = 0 in accordance with the general results of §132. In 
such a field there are no discrete levels (see §45), and son = 0. We now 
consider the variation of the difference 5,(A)~— 5;(0), where A is some given 
small quantity, as the potential well U(r) gradually becomes deeper. As this 
occurs, the first, second etc. levels successively appear at the top of the well, 
and the phases 6,(A) are increased by ~v each time.ft On reaching the given 
U(r) and then making A > 0, we obtain formula (133.17). 


PROBLEMS 


PROBLEM 1. Express the effective interaction range ro in terms of the wave function of the 
stationary state E = e in the “inner” region r~a (Ya. A. Smorodinskii 1948). 


SOLUTION. Let xo be the wave function in the region r~a, normalized by the condition 
that xo >1 as roo, Then the square of the wave function can be written in all space in the 
form x? = A,*(e-**’ + yo?—1); this expression becomes A,2e-**" for xr > 1 and Ay?xq? for 
kr <1. It must be normalized by the condition 


© 1 oo 
fe dr = Act — fa-x ar) = 1, 
0 0 


and a comparison with (133.13) gives 


From equation (133.1) with U(r) <0, the solution of wh: 
xs) = i , ch is xo, ıt follows th 


t In formula (133.6) this corresponds to a change of 5) from O to 7 when, for a given small 
value of k, the quantity « changes from a negative value (—x > k) to a positive value x > k 
When /#0, the same follows from the formula k cot 8; = —bE {E —e«e) when, for a Pin 
E = A, e varies from e > Ato —e > A. i g 
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PROBLEM 2. Determine the change in the phases & when the field U(r) is varied. 


SoLuTION. Varying U(r) in Schrédinger’s equation 
,, 2m |e A21(1+ 1) 5 
ae 2mr? |x= ; 
we obtain 


2m | AZIL + 1) 


A 2m 


Multiplying the first equation by §x:, the second by x:, subtracting, and integrating with 
respect to 7, we find 


p 2m i ; 
[xðxı -Xi Dika = pe i x126U dr. 
0 


Substituting on the left-hand side the asymptotic expressions 
xi = sin(kr — ġir — ôt), 


oy! = 8(6;)cos(kr —tlr+ ĉr) 


(the choice of the coefficient 1 in this expression determining the normalization used), we 
obtain 


58 2m F 2 
(ò) = - = | x U dr, 
0 


From this formula we can draw certain conclusions regarding the sign of the phases 
§:, considered as continuous functions of energy. To avoid the ambiguity in the definition 
of these functions (an additive multiple of 7) we shall normalize them by the condition 
8:(0) = 0. 

Starting from U = 0, when all the ê: are zero, and gradually increasing |U], we find that 
in a repulsive field (U >0) all the 6: <0, and in an attractive field (U < 0) 6 >0. In a 
repulsive field &(0) = 0 and therefore, for small energies, the ô: are small; the scattering 
amplitude is therefore negative: f =% 80/k <0. In an attractive field the corresponding 
deduction that f is positive can be made only if there are no discrete levels. Otherwise, when 
E is small, the phases ê; are close to nv, not to zero (see (133.17)), and no conclusion can be 
drawn concerning the sign of f. 


PROBLEM 3. Find the scattering length a and the effective range of interaction ro for a 
spherical square potential well of radius a and depth U, containing a single discrete energy 
leve) near zero. 


SOLUTION. We proceed as in §132, Problem 1, except that in the region within the well 
we do not neglect the particle energy E = h?k*/2m in comparison with Ue. The equation to 
determine the phase &, is found to be 


1 
k cot (8, + ak) = K cot aK, K = F o/[(2n(U, + EN. 
In order that the well should contain only one level, close to zero, it 1s necessary that 
Uo = (72h? /8ma?)(1 + A) 


with A < 1; see 833, Problem 1. Expanding the above equation in powers of ka and A, we 
find that 


¥ ae 
k cot 39 = ——A+ sak?, 
Sa 


whence a = 1/xy = 8a/72A, ro = a. The value of x, coincides, as it should, with that of 
4/(2m|E,])/#, where E, is the energy of the level in the well; see §33, Problem 1. 
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PROBLEM 4. Express the integral 


fv dr 
0 


of the squared wave function of the s state in terms of the phase 8)(k) for a field U(r) that is 
zero outside a sphere of radius a (G. Liiders 1955). 


SOLUTION. According to (128.10), 


: aie ey ay 

Sip a aA 
E me ek 1G ee 
0 


where the prime denotes differentiation with respect to r (and the derivatives with respect to 
E in (128.10) are replaced by those with respect to k = y(2mE})A). Since, at r = a, there 
is no field, we can use on the right-hand side the wave function of free motion, y = 2 sin 
(kr + 8) (normalized as in (33.20)). The result is 


f any L 
2dr = 2| a+ — | —— sin 2(ka + ĉo) > 0. 
fx ( =) pre) 
Since the integral of y* is certainly positive, the expression on the right must also be positive.t 


$134. Resonance at a quasi-discr2.e level 


A system which can disintegrate does not, strictly speaking, have a discrete 
energy spectrum. The particle leaving it when it disintegrates recedes to 
infinity; in this sense, the motion of the system is infinite, and hence the 
energy spectrum is continuous. 

It may happen, however, that the disintegration probability of the system 
is very small. The simplest example of this kind is given by a particle sur- 
rounded by a fairly high and wide potential barrier. Another possible 
reason for metastability of a state is that the spin of the system must change 
in a disintegration due to a weak spin-orbit interaction. 

For such systems with a small disintegration probability, we can introduce 
the concept of quasi-stationary states, in which the particles move “inside 
the system” for a considerable period of time, leaving it only when a fairly 
long time interval r has elapsed; r may be called the lifetime of the almost 
Stationary state concerned (7 ~ 1/z, where w is the disintegration preb- 
ability per unit time). The energy spectrum of these states will be quasi- 
discrete; it consists of a series of broadened levels, whose width is related 
to the lifetime by T ~ fi/r (see (44.7) ). The widths of the quasi-discrete 
levels are small compared with the distances between them. 

In discussing the quasi-stationary states, we can use the following formal 
method. Until now we have always considered solutions of Schrédinger’s 
equation with a boundary condition requiring the finiteness of the wave 
function at infinity. Instead of this, we shall now look for solutions which 
represent an outgoing spherical wave at infinity; this corresponds to the 
particle finally leaving the system when it disintegrates. Since such a 
boundary condition is complex, we cannot assert that the eigenvalues of 


+ This inequality had previously been derived in a different manner by Wigner (1955). 
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the energy must be real. On the contrary, by solving Schrédinger’s equa- 
tion, we obtain a set of complex values, which we write in the form 


f= Eye (134.1) 


where E, and T are two constants, which are positive (see below). 
It is easy to see the physical significance of the complex energy values. 
The time factor in the wave function of a quasi-stationary state is of the form 


e-IMEt — p-(t/MEgte tT M2, 


Hence all the probabilities given by the squared modulus of the wave function 
decrease with time as e~F/)!4 In particular, the probability of finding 
the particle “inside the system” decreases according to this law. Thus I 
determines the lifetime of the state; the disintegration probability per unit 
time is 


w = Th. (134.2) 


At large distances the wave function of the quasi-stationary state (the 
outgoing wave) contains the factor 


exp[ira/ {2m(Eo—4iT)}/A), 


which increases exponentially as r -> œ (the imaginary part of the root 
is negative). Hence the normalization integral { ||? dV for these functions 
diverges. It may be noted, incidentally, that this resolves the apparent 
contradiction between the decrease with time of |4|? and the fact that the 
normalization integral can be shown from the wave equation to be a constant. 

Let us ascertain the form of the wave function which describes the motion 
of a particle with energy close to one of the quasi-discrete levels of the system. 

As in §128, we write down the asymptotic form (at large distances) of the 


+ We may note that this shows the physical necessity for I to be positive, a condition which 
is automatically satisfied on account of the boundary condition imposed at infinity on the 
solution of the wave equation, or by the equivalent (see §130) rule of passage round poles 1n 
the formulae of perturbation theory. Let transitions from the discrete level 7 to the states v 
of the continuous spectrum be caused by a constant perturbation V. Then the second-order 
correction to the energy level is 


Fn]? d» : 
rc) en) ed De alee 
= oss 


cf. (38.10). The rule (43.10) gives 
P= —2im E,'® = 2m | |Van? (En —E,) dv, 


in agreement with (43.1) for the transition probability. 
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radial part of the wave function in the form (128.1): 


1 —2mE —2mE 
R, = -| Ax(E}exp( — Ae) + B(E)exp (=) (134.3) 
r 
and regard E as a complex variable. For real positive £, 
1 
Ry = ~[AE)e* + BE], k = /(2mEh, (134.4) 
r 


and A,(E) = B*(E) (see (128.3), (128.4)); the function B,(E) is here taken 
on the upper edge of a cut along the right half of the real axis. 

The condition which determines the complex eigenvalues of the energy 
consists in the absence of an ingoing wave from the asymptotic expression 
(134.3). This means that for E = Eo— 4T the coefficient B(E) must vanish: 


B(Eo—}iT) = 0. (134.5) 


Thus the quasi-discrete energy levels, like the true discrete levels, are zeros 
of the function BE). However, unlike the zeros which correspond to true 
levels, they do not lie on the physical sheet: in writing the condition (134.5) 
we have assumed that the required wave function of the quasi-stationary 
state arises from the same term in (134.3), which is an outgoing wave ( ~etkr) 
when E > 0 also (in (134.4)). But the point E = Ej—4:T lies below the 
positive real axis. This point can be reached from the upper edge of the cut 
(where the coefficients in (134.4) are defined), without leaving the physical 
sheet, only by passing round the point E = 0. Then y -— E changes sign, 
so that the outgoing wave becomes an ingoing one. Consequently, to pre- 
Serve the outgoing wave the point must be reached by going directly down 
through the cut, on to another, non-physical, sheet. 

Let us now consider real positive energy values close to the quasi-discrete 
level (assuming, of course, that I’ is small, since otherwise no such close 
values could exist). Expanding the function B,(E) in powers of the difference 
E'—(£o—3:1) and taking only the first-order term, we have 


where b; is a constant. Substituting in (134.4), we obtain the following 
expression for the wave function of a state close to the quasi-stationary state: 


1 
Ry = ~[(E—Eo— HT )br*etkr 4 (E— Eo + HD )bje-tkr], (1347) 


The phase 6, of this function is given by 


E— Eo—4iT 
E—Eot+#iT 


= eun(1 = _) i 
E— Eo+4iT 7” (LR) 


e2té: — e2td,'°) 
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where 
ei — (— 1)+15*/bp. (134.9) 


For |E — Eo| > I, the phase ô; is equal to 81, so that 6; is the value of the 
phase far from the resonance. 

In the resonance region ô; varies considerably with energy. If we rewrite 
formula (134.8), using the result 


gitan— A l + tr 


i 


e2ttan— A ne 
re) Ae 
in the form 


iN 
ò = 6) —tan-1—_—___, 134.1 
l l n XE Eo) ( 0) 


we see that the phase changes by = in a passage through the whole resonance 
region (from E < Eo to E > Eo). 

For E = Eo- 4T, the function (134.7) becomes R; = — (Ir lb * etk". 
If the wave function is normalized by the condition that the integral of |p|? 
over the region within the system is unity, the total current in this outgoing 
wave, equal to z|zIb;*|2, must be equal to the distintegration probability 
(134.2). Hence we find 


Jol? = 1/ħeT. (134.11) 


These results enable us to determine the amplitude of elastic scattering 
of a particle with energy E close to some quasi-discrete level Eo of the 
compound system consisting of the scattering system together with the 
particle undergoing scattering. In the general formula (123.11) we must 
substitute the expression (134.8) in the term with the value of / which corres- 
ponds to the level Eo. This gives 


nia 


i 218: P (cos 6), 134.12 
BoE HT (cos 6) ( ) 


21+1 

(QA) = 

where {(6) is the scattering amplitude far from the resonance, which is 

independent of the properties of the quasi-stationary state (it 1s given by 

formula (123.11) with 6; = 6), in each term of the sum).t The amplitude 

f(@) is called the potential scattering amplitude, and the second term 

in (134.12) the resonance scattering amplitude. The latter has a pole at 
E = Ey—iI, which, as shown above, is not on the physical sheet. | 

Formula (134.12) determines the elastic scattering near resonance at one 

of the quasi-discrete levels of the compound system. Its range of validity 





+ If scattering of a charged particle by a system of charged particles is considered, the 
expression (135.11) must be used for the phases 6;!°. . 

{It may be noted that formula (133.15) for resonance scattering of slow particles by a 
positive energy level « with /40, with E close to «, is in exact correspondence with the 
resonance tetm in (134.12). The values of E, and F are given by formulae (133.16), and since 
E is small the phase 6)‘® is small, so that eia]. 
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is defined by the requirement that the difference |E — Eo] should be small 
compared with the distance D to the adjoining quasi-discrete levels: 


|E—Eo| < D. (134.13) 


This formula is somewhat simplified if the scattering of slow particles is 
being considered, 1.e. if the wavelength of the particles in the resonance 
region is large compared with the dimensions of the scattering system. Here 
only s-wave scattering is important; we shall suppose that the level Eo 
does in fact belong to motion with / = 0. The potential scattering amplitude 
then reduces to a real constant — a (see §132).f In the resonance scattering 
amplitude we put / = 0 and replace e240 by unity, since 69 = —ak< 1. 
Thus we find 


ir 


a k(E—Eo+}il) 


(134.14) 


In a narrow range |E — Eo| ~ T the second term is large compared with the 
amplitude «, and the latter may be omitted. Farther from the resonance, 
however, the two terms may be comparable. 

In the above derivations it has been tacitly assumed that the value Ep of 
the level itself is not too small, and that the resonance region is not in the 
neighbourhood of the point E = 0. If resonance at the first quasi-discrete 
level of the compound system is considered, which lies at a distance from 
E = 0 small compared with the distance to the next level (Ep < D), the ex- 
pansion (134.6) may be no longer permissible. This is seen from the fact 
that the amplitude (134.14) does not tend to a constant limit as Æ — Q0, as 
would be necessary for s-wave scattering according to the general theory. 

Let us consider the case of a quasi-discrete level close to zero, again 
assuming that in the resonance region the particles undergoing scattering are 
so slow that only s-wave scattering is of importance. 

The expansion of the coefficients B)(E) in the wave function must now be 
made in powers of the energy E itself. The point E = 0 is a branch point of 
the functions B)(Z), and a passage round this point from the upper to the lower 
edge of the cut changes B,(E) into B;*(E). This means that the expansion is 
in powers of ,/—E, which changes sign on the above-mentioned passage. 


We write the first terms in the expansion of the function Bo(£) for real positive 
E in the form 


Bo(E) = (E-e0+ ty+/E)bo( E), (134.15) 


where eo and y are real constants, and bo(E) a function of energy, which can 
also be expanded in powers of yE but has no zero near the point FE = 0. 
The quasi-discrete level E = Ey—}iT corresponds to the vanishing of the 





t Itis assumed that the scattering field decreases sufficiently rapidly with increasing distance. 
In §145 the results given here will be applied to the scattering of slow neutrons by nuclei. 

t The function 6,(E) determines, according to (134.9), the phase of the potential scattering. 
In the scattering of slow particles, the first terms in its expansion are b(E) = constant x 


t(1 + ick). 
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factor E— eo +iyy/E, continued into the lower half-plane of the non-physical 
sheet; we therefore have for the determination of Ep and I the equation 


E ep+-ty/(Eo—iil) = 0 (134.16) 


(the constants «9 and y must be positive in order that Ep and I should be 
positive). For example, a level with width T < Eo corresponds to the rela- 
tion «9 > y? between these constants, and from (134.16) we have Eo = €p, 
i = 2y4/€0. 

The expression (134.15) replaces in this case formula (134.6); the subse- 
quent formulae must be correspondingly modified (everywhere replacing Eo 
by eo and I by 2y./E). Hence we obtain for the scattering amplitude, instead 
of (134.14), the expression 


hy 
oa mm a e 
/(2m)(E— 0+ iys/E) 


(where we have put k = 4/(2mE)/h, m being the reduced mass of the particle 
and the scattering system). For E > 0 this amplitude tends to a constant 
limit, as it should, thus confirming the form of the expansion (134.15). 

It may be noted that the expression (134.17) also covers the case of a true 
discrete level of the compound system close to zero, which is given by an 
appropriate relation between the constants eo and y. If |eo| < y*, the first 
term E may be neglected in the denominator of the resonance term for 
energies E < y*. 

Neglecting also the potential scattering amplitude «, we obtain the formula 


AE (134.17) 


1 
j= -—m 
ik— /(2m)eo| fry 
which is the same as formula (133.7) (with x = — +/(2m)eo/hy). This corres- 


ponds to resonance at the level E = eo?fy?, which is a true or virtual discrete 
level according as the constant « is positive or negative. 


§135. Rutherford’s formula 


Scattering in a Coulomb field is of interest from the point of view of 
physical applications. It is also of interest in that, for this case, the quantum- 
mechanical collision problem can be solved exactly. 

When there is a direction (in this case, the direction of incidence of the 
particle) which can be distinguished from the remainder, Schrédinger's 
equation in the Coulomb field is conveniently solved in parabolic coordi- 
nates £, 7,¢ (§37). The problem of the scattering of a particle in a central 
field is axially symmetric. Hence the wave function W is independent of the 
angle ¢. We write the particular solution of Schrédinger’s equation (37.6) 
in the form 


Y = AKE) feln); (135.1) 
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this is (37.7) with m = 0. Accordingly, after separating the variables, we 
obtain} equations (37.8) with m = 0: 


ds ah 
sal fae) TOME ADA Si 


(135.2) 
d / df 


E A +(łk?n—P2)f2 = 0, Bitfe = 1. 
n\ dy 


The energy of the particle scattered is, of course, positive; we have put 
E = $k*. The signs in equations (135.2) are for the case of a repulsive field; 
exactly the same final result is obtained for the scattering cross-section in an 
attractive field. 

We have to find that solution of Schrédinger’s equation which, for nega- 
tive z and large r, has the form of a plane wave: 


p ~ ek? for—=-œ <z <0, r> œ, 


corresponding to a particle incident in the positive direction of the z-axis. 
We shall see from what follows that the condition imposed can be satisfied 
by a single particular integral (135.1); a sum of integrals with various values of 
Bi, B2 is not needed. 

In parabolic coordinates, this condition takes the form 


y ~ ebklE-m/2 fory -> œ and all &. 
This can be satisfied only if 
filg) = es? (135.3) 
and f,(7) is subject to the condition 


faln) ~ etka for n —> œ. (1335.4) 


Substituting (135.3) in the first of equations (135.2), we see that this 
function does in fact satisfy the equation, provided that the constant 8, = }zk. 
The second equation (135.2), with Bz = 1~ 3, then takes the form 


d dfs 
= Tae) +MY 1+ HI fi =f 


Let us seek its solution in the form 
Jelan) = enwn), (135.5) 


where the function w(n) tends to a constant as 7 > oo. For w(7) we have 
the equation 
qw” +(1—ikn)w’'—w = 0, (135.6) 


t In this section we use Coulomb units (see §36). 
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which, by introducing the new variable y = ikņ, can be reduced to the 
equation for a confluent hypergeometric function with parameters « = —7/R, 
y = 1. We have to choose that solution of equation (135.6) which, on being 


multiplied by /,(¢), contains only an outgoing (i.e. scattered) and not an 
ingoing spherical wave. This solution is the function 


w = constant x F(—:/k, 1, tkn). 


Thus, on assembling the expressions obtained, we find the following exact 
solution of Schrédinger’s equation, describing the scattering: 


Y = e- Ik (1+ ijk}jetkt-0/2F(—ijk, 1, ikn). (135:4) 


We have chosen the normalization constant in & such that the incident plane 
wave has unit amplitude (see below). 

In order to separate the incident and scattered waves in this function, we 
must consider its form at large distances from the scattering centre. Using 
the first two terms of the asymptotic expansion {formula (d.14)) for the con- 
fluent hypergeometric function, we have for large n 


(—ikn)*/* 1 (ikn)-t/* etka 


F(—i/k, 1, ik) x ———(14+—— )+—_—- —— 
(TAR BP) EEN n / TTR) i 





en [2K 1 (i/k)en/2k etkn 

ee ee See Ja Ik) logik n) e Mik) logik), 
T+i/k)\ | ikBy T(1—i/k) ikn 

Substituting this in (135.7) and changing to spherical polar coordinates 

(E-n = 22,9 = r—z = r(1— cos 6)), we have the following final asymptotic 

expression for the wave function: 








l f(8) 
y = E ae | pikz+(t /k) log(kr—kr cos 6) 4. etkr Mt /k) log2kr) (135.8) 
ikèr(1 —cos ê) r 
where 
1+2/k 
KO) Dn —e~21/k) log sınĝ/2 ae (135.9) 
2k? sin2h6 r(1— ijk) 


The first term in (135.8) represents the incident wave. We see that, in 
consequence of the slow decrease of the Coulomb field, the plane wave 1s 
distorted even at large distances from the centre, as is shown by the presence 
of the logarithmic term in the phase and of the 1/7 term in the amplitude. 
The distorting logarithmic term in the phase is found also 1n the scattered 





+ The origin of this distortion may be elucidated classically. If we consider a family of 
classical Coulomb hyperbolic paths with the same direction of incidence (parallel to the 
z-axis), the equation of the surface normal to them at large distances from the scattering centre 
(z ~— 0) is easily shown to tend to s+k7* log k(y —2) = constant, not 3 = constant. This 
is the surface of constant phase of the incident wave in (135.8). 
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spherical wave given by the second term in (135.8). These differences from 
the usual asymptotic form of the wave function (123.3) are unimportant, 
however, since they give a correction to the current density which tends 
to zero as 7 > OO. 

Thus we obtain for the scattering cross-section do = / f(6)|? do the 
formula 


do = do/4k4 sin4}é, 
or, in ordinary units, 


do = (a/2mv?)2 do/sin*}6, (135.10) 


where the velocity v of the particle = k/m. This is the familiar Rutherford’s 
formula given by classical mechanics. Thus, for scattering in a Coulomb field, 
quantum and classical mechanics give the same result (N. Mott, and W. 
Gordon, 1928). Born’s formula (126.12) naturally leads to the same 
expression (135.10) also. 

We shall give for reference the expression for the scattering amplitude 
(135.9), written as a sum of spherical harmonics. This is obtained by 
substituting in (124.5) the phases from (36.28), i.e. 


exp(2i51, com) = (I+ 1+i/k)/T(+1 --z/k). (135.11) 
Thus we find 
1 T+ 1+i/k) 
K= z 2 (Dim TH) (135.12) 


The signs in the scattering amplitude (135.9) correspond to a repulsive 
field. In an attractive Coulomb field, formula (135.9) is replaced by the 
complex conjugate expression. f(6) then becomes infinite at the poles of the 
function I'(1 —:/k), i.e. at points where the argument of the gamma function 
is a negative integer or zero (when im k> 0 and the function ry decreases at 
infinity). The corresponding energy values are $k2 = —1/2n? (n = 1, 2, 3, 
..-), and coincide with the discrete energy levels in the Coulomb field 


(cf. §128). 


§136. The system of wave functions of the continuous spectrum 


In the analysis of motion in a centrally symmetric field (Chapter V) we 
have considered stationary states in which the particle has definite values of 
the energy, the orbital angular momentum J, and the component m of this 
angular momentum. The wave functions of such states of the discrete 
spectrum (%ntm) and the continuous spectrum (Wkim, energy h2k2/2m) 
together form a complete set in terms of which the wave function of any state 


t The value of Coui 1n this formula differs from the true (divergent) Coulomb phase by a 
Quantity which is the same for all Z. 
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may be expanded. Such a set of functions is, however, not appropriate for 
problems in scattering theory. Here another set is convenient, in which the 
wave functions of the continuous spectrum are described by a particular 
asymptotic behaviour: at infinity there is a plane wave and an outgoing 
spherical wave. In these states the particle has a definite energy, but no 
definite angular momentum magnitude or component. 

According to (123.6) and (123.7), such wave functions, here denoted by 
Y, are given by 


1 © | | 
fet: (2 r 
: i 14 et; Rei(r)Pi(k - rkr). (136.1) 


The argument of the Legendre polynomiais is written as cos 6 = k. rikr, 
and the expression therefore does not involve any particular choice of the 
coordinate axes as it did in (123.6) (where the z-axis was the direction of 
propagation of the plane wave). By giving the vector k all possible values, 
we obtain a set of wave functions, which, as we shall now show, are ortho- 
gonal and normalized by the usual rule for the continuous spectrum: 


f PP dV = (27)8 &(k' —k). (136.2) 


To prove this,t we note that the product wit * gh is expressed by a double 
sum over / and l’ of terms containing the products 


Pi(k . r/kr)Pi(k' . r/k'r). 


The integration over the directions of r 1s effected by means of the formula 


| Pulk. x/kr)Pi(k’« rlk'r)do = bur aa Pilk kkk’); (136.3) 


cf. (c.12) in the Mathematical Appendices. This leaves 
Los) Le 9) 
| AMPH) dV = a > (214 1)etlesd-24k NP (cos y) | Reil(r)Rulr)?? dr, 
10 0 


where y is the angle between k and k’. The radial functions Rx: are ortho- 
gonal, however, and are normalized by 


oc 


Í RyRy r2 dr = 278(k' —k). 
0 


i 


+ Essentially, only the orthogonality of the phe ) needs to be proved separately, the nor- 
malization could be derived directly from the asymptotic form of the functions (cf. §21). In 
this sense, the validity of (1 36.2) is evident from the fact that, as r > &, the only non-decreasing 


term in these functions is ph ae ether, 
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Hence we can put k = R’ in the coefficients in front of the integrals; using 
also the relation (124.3), we have 


ne, a NA 
| uan dV = reals —R) > (21+ 1)Pi(cos y) 
[=0 
872 


i 


TR — k)ô( 1 — cos y). 


The expression on the right is zero for k # k’; on being multiplied by 
2rk? sin y dk dy/(27)% and integrated over all k-space it gives 1, and this 
proves formula (136.2). 

Together with the system of functions y+’, we can also introduce a system 
corresponding to states in which there are at infinity a plane wave and an 
ingoing spherical wave. These functions, which we denote by ¢{7), are 
obtained directly from the p{*?: 


= + 
Yi) = RP hak (136.4) 
since the complex conjugate of e?*"/r (outgoing wave) is e~‘47/r (ingoing 
wave), and the plane wave becomes e~™', so that, in order to retain the 


previous definition of k (plane wave e™"), we must replace k by —k, as in 
(136.4). Noticing that P;(—cos 8) = ( — 1}!P (cos 8), we obtain from (136.1) 


sere = > #214 1)eRe(e)Pi(k-/hr). (136.5) 
j= 


The case of a Coulomb field is of great importance. Here the functions 
i." (and yi’) can be written in a closed form, which is obtained directly 
from formula (135.7). We express the parabolic coordinates by 


$h(E—n) =kz =k.r, kn =kR(r—z) = kr—k.r. 


Thus we obtain for a repulsive Coulomb field+ 


be? =e /2kD(1+4i/k)etk-rF(—ij/k, 1, ikr—ik .r), (136.6) 
(—) = e~7 2k (1 —ijk)jetk.tF(ilk, 1, —ikr—ik .r). (136.7) 


t Using Coulomb units. 
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The wave functions for an attractive Coulomb field are found by simul- 
taneously changing the signs of k and r: 


YL) = er PET (1—i/k)e'k -F (jk, 1, ikr—ik r), ee) 
bi = et I2KT(1+ijkjett-"F(—tjR, i; —ikr—ik.r). (136.9) 


The action of the Coulomb field on the motion of the particle near the 
origin may be characterized by the ratio of the squared modulus of Y{*? or 
Jm at the point r = 0 to the squared modulus of the wave function ýy = 
e*" for free motion. Using the formula 


(1 + ik) PQ. — ijk) = (ijk) TGR) TL — ilk) 


= mk sinh(7jk), 
we easily find, for a repulsive field, 
EPO? — WOE N 
TAK ul? permian) 
and for an attractive field, 
(+ Q)12 (-X0)}2 Jr 
a - Alet) itt, 


The functions (+) and Y{~ play an important part in problems relating to 
the application of perturbation theory in the continuous spectrum. Let us 
suppose that, as a result of some perturbation VY, the particle makes a tran- 
sition between states of the continuous spectrum. The transition probability 
is determined by the matrix element 


f ust Py AV. (136.12) 


The question arises of which solutions of the wave equation are to be taken 
as the initial (y4) and final (#y) wave functions, in order to obtain the ampli- 
tude for a transition of the particle from a state with momentum hk to one with 
momentum Ak’ at infinity.+ We shall show that this requires that 


be = HO, be = AP azeda 
(A. Sommerfeld 1931). 





t An example of such a process is an electron colliding with a heavy nucleus at rest and 
emitting a photon, thereby changing its energy and Its direction of motion; the perturbation V 
is the interaction between the electron and the radiation field, and the Coulomb field of the 


nucleus is the field U for which the functions wit and W, | are defined (see RQ T, §§92 and 96). 
Another example is a collision of an electron with an atom, accompanied by ionization of the latter; 
see §148, Problem 4. 
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This becomes clear if we consider how the problem would be solved by 
perturbation theory applied not only as regards the perturbation V but also 
as regards the field U(r) in which the particle is moving. In the zero-order 
approximation (with respect to Ù), the matrix element (136.12) is 


Vig ee av 


In subsequent approximations with respect to U, this integral is replaced by 
a series of which each term is an integral 


Viet, Gia, ses U knk l d3kı o d3k,,: 

(E, — Ey, +10) ... (By ~ Ey, +10) 

cf. §§43 and 130. The numerator contains the matrix elements (in varying 
order) with respect to the unperturbed plane waves, and all poles are avoided 
in the integrations, according to one fixed rule. On the other hand, this 
series can be obtained as the matrix element (136.12) with the wave functions 
yy and ẹp as perturbation-theory series with respect to the field U. The fact 
that the result must be a sum of integrals in which all poles are avoided by the 
same rule means, therefore, that the poles in the terms of the series represen- 
ting y; and y-* must be avoided by a similar rule. But if the wave equation 
is solved by perturbation theory with this avoidance rule, we necessarily 
obtain a solution whose asymptotic form includes an outgoing (as well as a 
plane) wave. In other words, the wave functions, which in the zero-order 
approximation (with respect to U) have the form 


bri ss ge. us * = oa =, 


must be replaced by exact solutions of the wave equation, respectively yt? 
and {> + (¢{-)*. This proves the rule (136.13). 

The choice of ys) as the final wave function applies also to transitions 
from the discrete to the continuous spectrum; here there is, of course, no 
problem of choosing y4. 


§137. Collisions of like particles 


_ The case where two identical particles collide requires special considera- 
tion. The identity of the particles leads in quantum mechanics to the 
appearance of a peculiar exchange interaction between them. This has an 
important effect on scattering also (N. F. Mott 1930).+ 

The orbital wave function of a system of two particles must be symmetric 
Or antisymmetric with respect to the particles, according as their total spin 
1s even or odd (see §62). The wave function which describes the scattering, 
and which is obtained by solving the usual Schrédinger’s equation, must 
therefore be symmetrized or antisymmetrized with respect to the particles. 
An interchange of the particles is equivalent to reversing the direction of the 





t Here the direct spin-orbit interaction is again ignored. 
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radius vector joining them. In the coordinate system in which the centre 
of mass is at rest, this means that r remains unchanged, while the angle 6 
is replaced by 7—6 (and so z = r cos 6 becomes —z). Hence, instead of the 
asymptotic expression (123.3) for the wave function, we must write 


yb = ekz 4 eth. etkr[f(0)+ f(n—8)]/r. (137.1) 


By virtue of the identity of the particles it is, of course, impossible to say 
which of them scatters and which is scattered. In the coordinate system 
in which the centre of mass is at rest, we have two equal incident plane waves, 
propagated in opposite directions (e'*2 and e-'*2). The outgoing spherical 
wave in (137.1) takes into account the scattering of both particles, and the 
probability current calculated from it gives the probability that either of the 
particles will bescattered into the element do of solid angle considered. The 
scattering Cross-section is the ratio of this current to the current density in 
either of the incident plane waves, 1.e. it 1s given, as before, by the squared 
modulus of the coefficient of e**"/r in the wave function (137.1). 

Thus, if the total spin of the colliding particles is even, the scattering 
cross-section 1s of the form 


do, = |f(0)+f(z—8)|? do, (137.2) 
while if the total spin is odd, it is 
dog = | f(@)—f(7z—8)|? do. (137.3) 


The appearance of the interference term f (6) f* (7— 6)+f *(@) f (7—0) 
characterizes the exchange interaction. If the particles were different, as 
they are in classical mechanics, the probability that either of them would be 
scattered into a given element of solid angle do would simply be equal to the 
sum of the probabilities that one particle is deviated through an angle @ and 
the other through 7-6; in other words, the cross-section would be 


UOH (#—9)|?} do. 


In the limiting case of low velocities, the scattering amplitude tends to a 
constant value independent of the angle (§132) if the interaction of the 
particles decreases sufficiently rapidly with increasing distance. It is seen 
from (137.3) that dog is then zero, i.e. only particles with even total spin 
scatter each other. 

In formulae (137.2), (137.3) it is supposed that the total spin of the col- 
liding particles has a definite value. If the particles are not in definite spin 
states, then to determine the cross-section it is necessary to average, assuming 
all possible spin states to be all equally probable. We have shown in §62 that, 
of the total number of (2s + 1)2 different spin states of a system of two particles 
with spin ş, s(2s+ 1) states correspond to an even total spin and (s + 1)(2s + 1) 
to an odd total spin (if s is half-integral), or vice versa if s is integral. Let us 
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first suppose that the spin s of the particles is half-integral. Then the 
probability that the system of two colliding particles will have even S is 
s(2s + 1)/(2s +1)? = s/(2s+1), while the probability of odd Sis (s+ 1),(2s +1). 
Hence the cross-section is 


S$ § 
TTEA A 137.4 
2a aa eae 





dor 
Substituting here (137.2), (137.3), we obtain 


1 
LOS") *(8) f(a—0)]} do. 


gs 2 a O\/2— 
deo = {| f()P+|f(7—8) ES (137.5) 





Similarly, we find for integral s 


1 
2s+1 





do = {f()P+|f(7—6)|2-+——1 f (0) f*(a—6) + f *(8) f(m—6) ]} do. 


(137.6) 


As an example, we shall write out the formulae for the collision of two 
electrons interacting by Coulomb's law (U = e2/r). Substitution of the 
expression (135.9) in the formula (137.5) with s = } gives (in ordinary units), 
after a simple calculation, 


, e \2 1 1 1 (- 40) | P 
= —— + ——— —-——____cos[ — log tan : 
A (=) E 3 cos#4@ sin®36 cos?$0 hv 5 . 
(137.7) 


where we have introduced the mass m, of the electron in place of the reduced 
mass m= mọ. This formula is considerably simplified if the velocity is 
so large that e? < vh; we notice that this is just the condition for perturba- 
tion theory to be applicable to a Coulomb field. Then the cosine in the third 
term can be replaced by unity, and we have 








2e2 \2 4—3 sin26 
do = ( ) —— do. (137.8) 
mgou? sin4ĝ 


The opposite limiting case, e*>vh, corresponds to the passage to the limit 
of classical mechanics (see the end of §127). In formula (137.7) this transition 
occurs in a very curious way. For e? > vh, the cosine in the third term in the 
Square brackets is a rapidly oscillating function. For any given 6, formula 
(137.7) gives for the scattering cross-section a value which in general differs 
considerably from the Rutherford value. However, on averaging over even 
a small range of values of 9, the oscillating term in (137.7) vanishes, and we 
obtain the classical formula. 

All the above formulae for the cross-section refer to a system of co- 
ordinates in which the centre of mass is at rest. The transition to a system 
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in which one of the particles is at rest before the collision is effected (according 


to (123.2)) simply by replacing 6 by 28. Thus, for a collision of electrons 
we have from (137.7) 


2e2 1 e2 
do = ( y [= 1e L o( log rant) | cos 3 do, 
mov? RETIA cost? sin? eT hu 


(13757) 
where do is the element of solid angle in the new system of coordinates. 


In replacing @ by 25, the element of solid angle do must be replaced by 
4 cos $ do, since sin 6 déd¢ = 4 cos $ sin $dSd¢. 





PROBLEM 
Determine the scattering cross-section for two identical particles of spin $, with given mean 
à = L 
spin values §, and S}. 


SOLUTION. The dependence of the cross-section on the polarizations of the particles must 
be expressed by a term proportional to the scalar §,.5,. We look for do in the form a + b5,.5,. 
For unpolarized particles (5, = §, = 0), the second term is absent, and according to (137.4) 
do = a = }(do;+3do,). If both particles are completely polarized in the same direction 
(§,.5, = }),; the system is certainly in a state with S = 1; in this case, therefore, do = a+jb = 
deg. With a and b determined from these two equations, we have 


do = }(dos+ 3doq) + (dog — dos)S: « S2. 


§138. Resonance scattering of charged particles 


In the scattering of charged nuclear particles (e.g. of protons by protons), 
as well as the short-range nuclear forces there is the Coulomb interaction, 
which decreases only slowly. The theory of resonance scattering in this case 
is developed by the same method as that described in §133. The only differ- 
ence is that the wave function in the region outside the range of action of the 
nuclear forces (r > a) must be, instead of the solution of the equation of free 
motion (133.2), the exact general solution of Schrodinger’s equation in a 
Coulomb field. Here the velocity of the particles is again assumed only so 
ans that ka < 1; the relation between 1/k and the Coulomb unit of length 

= h?|/mZ Z2? CARERE m is the reduced mass of the colliding particles) is 
left arbitrary.t 

For motion with / = 0 in a repulsive Coulomb field, Schrédinger’s 
equation for the radial function y = rRo is 


2 
x+ (HH =x o (138.1) 


here we use Coulomb units. In §36 the solution of this equation has been 
found, subject to the requirement that x/r is finite at r = 0. This solution, 


t The theory given below is due to L. D. Landau and Ya. A. Smorodinskil (1944). 
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which we here denote by Fo, has the form (see (36.27) and (36.28)) 


Fo = AettrkrF(i/k+1, 2, —2ikr), 


ea ZP (138.2) 
e?nik—] 
The asymptotic expression for this function at large distances is 


1 
Fo ~ sin( hr z 1082r +50,cou), 


(138.3) 
Ôo, Cou1 = arg [ (1+ t/k), 


and the leading terms of the expansion for small r (kr & 1,r <1) are 


Fo = Akr(1+r+- ...). (138.4) 


Now, however, with the changed boundary condition, the behaviour of the 
function at the origin becomes unimportant, and we need the general solution 
of equation (138.1), which is a linear combination of two independent 
integrals. 

The parameters of the confluent hypergeometric function in (138.2) are 
such (the value of y = 2 being integral) that the case described at the end 
of §d of the Mathematical Appendices occurs. In accordance with the 
discussion given there, we obtain the second integral of equation (138.1) 
by replacing the function F in (138.2) by some other linear combination of 
two terms whose sum is, according to (d.14), the confluent hypergeometric 
function. Taking the difference of these terms as the combination in question, 


we find the second independent solution of equation (138. 1) (denoted by Go) 
in the form} 


_ Ae-tkrky ; 
Go = oT ey GUL — ih, —i/k, —21kr); (138.5) 


the function Fo is the real part of the same expression. The asymptotic form 
at large distances is 


1 
Go ~ cos( r } log hr +50,cout), (138.6) 


and the leading terms of the expansion for small r are 


a 


ones 
wT 


(1 + 2r[log2r +2C—1+A(k)] + ...}, (138.7) 





t The functions F, and Gao {and the correspondingly defined functions F; and G with 130) 
are called regular and irregular Coulomb functions respectively. 
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where C = 0-577... is Euler’s constant, and A(k) denotes the function 
h(k) = reys(—1/k) + logk, (138.8) 


p(z) = IY(z)/P'(z) being the logarithmic derivative of the T function.t+ 
The general integral of equation (138.1) mav be written as the sum 


x = constant x (Fo cotôo + Go), (138.9) 


where cot ĝo is a constant. The notation is chosen so that the asymptotic 
form of this solution is 


1 
x~ sin( zr- A log2kr+80,com +80). (138.10) 


Thus ôo is the additional phase shift of the wave function due to the short- 
range forces. We have to relate it to the constant appearing in the boundary 


A(x) 





+ The expansion (138.7) is obtained from (138.5) by means of the expansion (d.17), using 
the well-known relation 


p1 +2) = oi(z)+1/z 
(which is easily derived from (z+1) = 2I(z)) and the values ¥(1) = —C, y2) = —C+1. 
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condition [x'/x]r.0 = constant, which replaces the treatment of the wave 
function in the region where nuclear forces act. Owing to the logarithmic 
divergence of the logarithmic derivative y‘/y as r > 0, this condition must 
be applied at some arbitrarily small but finite value r = p, not at r = 0. 
Calculating by means of formulae (138.4) and (138.7) the derivative y’( p)/x(p) 
and equating it to a constant, we obtain the boundary condition in the form 


kA? cot o+ 2[log 2p +2C + A(k)] = constant. 


The expression on the left-hand side of the equation contains the constants 
2 log 2p and 4C, which are independent of k; we include these in the constant 
on the right, and then denote it by —x. The final expression for cot So is, 
in ordinary units, 


1 
cotôo = ——{e2"/kac— 1)[A(kae) +4402); (138.11) 
T 


in the limit 1/ae >0, i.e. for uncharged particles, formula (138.11) becomes 
the relation cot 69 = — «fk, i.e. (133.6). 

Figure 49 shows a graph of the function h(x).+ 

Thus, when there is a Coulomb interaction, the “constant” is 


2n cot do 2 
— ~ + —h(ka,) = —k. (138.12) 
ac(e?7kae—1) ae 


We have put the word “constant” in quotation marks, since x is actually the 
first term in an expansion in powers of the small quantity ka of some function 
which depends on the properties of the short-range forces. As stated in §133, 
resonance at low energies corresponds to the case where the value of the con- 
stant x is anomalously small. Consequently, in order to improve the accuracy, 
we must take account also of the next term (~k?) in the expansion, which 





t To calculate the function A(k), we can use the formula 


co 


h(k) = k-2 
(k) = 2 aan Cte 


naj 
which is easily obtained by means of the formula 


l © 
2) = —C~-+3> 


n=l 


l e 
n(n+ 2)’ 





see Whittaker and Watson, Course of Modern Analysis, Cambridge, 1944, §12.16. The 
limiting expressions for h(k) are 


h(k) = k?j12 for” sk €1, 
h(k) = —C+logk+1-2/k? for k > 1; 


the latter formula gives values of h(k) which are correct to within 4% even for k> 2-5. 
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contains a coefficient of “normal” magnitude, i.e. in (138.12) — x must be 
replaced by — xo + $rok®.t 

The existence of resonance may, as stated in §133, be due to either a true 
or a virtual discrete bound state of the svstem. It can be shown{ that the sign 
of the constant « is again the criterion which determines whether the level is 
true or virtual. 

The total phase shifts of the wave functions are, according to (138.10), the 
sums 8) co +ô: The scattering cross-section is therefore 


(s 6) 


PORS =). (21 + 1)[e2i rcot- — 1]P,(cos 8). (138.13) 


l=0 


The difference in the brackets may be written 


e2t (61 Coul +5) —] = [ e2%61,Cou! oe 1] 


+ [e2%41,coul(e2t: — 1)], (138.14) 


The Coulomb phases 8, cou contribute equally, in order of magnitude, to the 
scattering amplitude for all 7. The phases ô; relating to the short-range 
forces are small for #0 at low energies. Hence, in substituting (138.14) in 
(138.13), we retain the first bracket in every term of the sum; the sum of 
these terms is the Coulomb scattering amplitude (135.9) 


1 21 i ! 
fcoul f) = -Ak sin® 46 exp E log sin 10 + 2i8 cou) (138.15) 


Ac 


The second bracket in (138.14) is retained only in the term with 7 = 0. Thus 
the total scattering amplitude 1s 


F (8) = fool 8) +55 (62. — eoon. (138.16) 
ae 


The second term in this expression may be called the nuclear scattering 
amplitude. It should be emphasized, however, that the division is arbitrary: 
in view of the definition of ôo in (138.11), the presence of the Coulomb inter- 
action has a considerable effect on this term also, which is quite different 
from the corresponding term with the same short-range forces for uncharged 
particles. In particular, when ka, >0 the phase So, and therefore the whole of 





+ The values of the constants « = 1/xo and r, for proton-proton scattering are « = —78*x 
10-33 cm, rọ = 2°8x10- cm (Coulomb unit of length 2/?/mpe? = 57-6 X 10-13 cm). These 
values relate to a pair of protons with antiparallel spins; when the spins are parallel a system of 
two protons cannot be in the s state, by Pauli’s principle. ee x: 

t See L. Landau and Ya. A. Smorodinski, Zhurnal éksperimental noi i teoreticheskot fiziki 14, 
269, 1944. 
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the second term in (138.16), tend exponentially to zero as e~2"/*@c, 1.e. the 
nuclear scattering is entirely masked by the Coulomb repulsion. 
In the scattering cross-section the two parts of the amplitude interfere: 


= ABF = (==) 








2mv2 sint46 
Thar sind cos( i log sin$0 +ô ) +4¢ka )? sin2d i 138.17 
2 ie c)? si (138. 
sin?}6 0 Pi, £ 0 o |-( 7) 


Here it is assumed that the colliding particles are different; for like particles, 
the scattering amplitude must be symmetrized before being squared (cf. 


§137). 


§139. Elastic collisions between fast electrons and atoms 


Elastic collisions between fast electrons and atoms can be treated by means 
of the Born approximation if the velocity of the incident electron is large 
compared with those of the atomic electrons. 

Owing to the large difference in mass between the electron and the atom, 
the latter may be regarded as at rest during the collision, and the system of 
coordinates in which the centre of mass is fixed is the same as that in which 
the atom is fixed. Then p and p’ in formula (126.7) denote the momenta of 
the electron before and after the collision, m the mass of the electron, and 
the angle 6 is the same as the angle of deviation $ of the electron. The poten- 
tial energy U(r) in formula (126.7) must be defined appropriately. 

In §126 we have calculated the matrix element Upp of the interaction 
energy with respect to the wave functions of a free particle before and after 
the collision. In a collision with an atom it is necessary to take into account 
also the wave functions describing the internal state of the atom. In an elastic 
collision, the state of the atom is left unchanged. Hence Ug must be deter- 
mined as the matrix element with respect to the wave functions p, and 4, of 
the electron; it is diagonal with respect to the wave function of the atom. 
In other words, U(r) in formula (126.7) must be taken to be the potential 
energy of the interaction of the electron with the atom, averaged with respect 
to the wave function of the latter. It is ef(r), where ¢(r) is the potential of 
the field at the point r due to the mean distribution of charges in the atom. 


Denoting the density of the charge distribution in the atom by p(r), we 
have, for the potential ¢, Poisson’s equation: 


Ad = —4np(r). 


The required matrix element U „is essentially the Fourier component of U 
(i.e. of $) corresponding to the wave vector q=k’—k. Applying Poisson’s 
equation to each Fourier component separately, we have 


Alpat- t) = — ghett = — Arpat", 
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so that 


dq = 4pq/q", 


l.e. 


f deta-r ay = (41/92) f pear ay. (139.1) 


The charge density p(r) consists of the electron charges and the charge 
on the nucleus: 


p = —en(r)+ Ze8(r), 


where en(r) is the electron charge density in the atom. Multiplying by 
er and integrating, we have 


f pete aV = —e | neta. dV + Ze, 
Thus we obtain for the integral in question the expression 
4ne* 
f Ue-a-r dV = —{[Z—F(9)], (139.2) 
g 
where F(q) is defined by the formula 
F(q) = f neta-r dV (139.3) 


and is called the atomic form factor. It is a function of the scattering angle and 
of the velocity of the incident electron. 

Finally, substituting (139.2) in (126.7), we obtain the following expression 
for the cross-section for the elastic scattering of fast electrons by an atom}: 


2e4 
i BS rays, gp = Sams: (139.4) 
hig h 





Let us consider the limiting case qao < 1, where ao is of the order of 
magnitude of the dimensions of the atom. Small scattering angles correspond 
to small q: $ < v/v, where vo~ħ/mao is of the order of magnitude of the 
velocities of the atomic electrons. 

Let us expand F(q) as a series of powers of g. The zero-order term is 
f ndV, which is the total number Z of electrons in the atom. The first- 





+ We are neglecting exchange effects between the fast electron which undergoes scattering 
and the atomic electrons, i.e. we do not symmetrize the wave function of the system. The 
legitimacy of this procedure is evident: the interference between the rapidly oscillating wave 
function of the free particle and the wave function of the atomic electrons in the “exchange 
integral” has the result that the corresponding contribution to the scattering amplitude 1s 
small. 
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order term is proportional to f rn(r) dV, i.e. to the mean value of the dipole 
moment of the atom; this vanishes identically (see §75). We must therefore 
continue the expansion up to the second-order term, obtaining 


1 
— = 2 2 s 
Z—F(a) = -4 Í nr? dV; 


substituting in (139.4), we obtain 


2 
do. (139.5) 





do = 


me? 
3 he | nr? dy 





Thus, in the range of small angles, the cross-section is independent of the 
scattering angle, and is given by the mean square distance of the atomic 
electrons from the nucleus. 

In the opposite limiting case of large q (qa > 1, 1-e.3 > vav), the factor 
ear in the integrand in (139.3) is a rapidly oscillating function, and there- 
fore the whole integral iè nearly zero. Consequently, we can neglect F(q) 
in comparison with Z, so that 


Ze N? do 
"R ( ) kad (139.6) 
2mv*/ sintt3 


i.e., we have Rutherford scattering at the nucleus of the atom. 
We may also calculate the transport cross-section 








or = { (1—cos 8)de. (139.7) 


In the range of angles $ <vo/v we have, according to (139.5), da = constant x 
x sin $ d$ = constant x $d, where the constant is independent of 9. 
Hence, in this region, the integrand in the above integral is proportional 
to 3, so that the integral converges rapidly at the lower limit. In the region 
1 > 3 > vo/v we have do = constant x d9$/93; the integrand is proportional 
to 1/9, and the integral (139.7) diverges logarithmically. Hence we see that 
this range of angles plays the chief part in the integral, and we need integrate 
only over this range. The lower limit of integration must be taken as of the 
order of v/v; we shall write it in the form e#/yhv, where y is a dimensionless 
constant. As a result we have the formula 


Str = 4n( Ze2/mv2)? log(yhv/e?). (139.8) 


An exact calculation of the constant y requires a consideration of scattering 
through angles $ > vo/v, and cannot be carried out in a general form; ctr 
depends only slightly on the choice of this constant, since it enters only in 
a logarithm, and multiplied by the large quantity Av/e?. 

For a numerical calculation of the atomic form factor 


ead for heavy atoms, we 
can use the Thomas—Fermi distribution of the density \ - 


n(r). We have seen 
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that, in the Thomas—Fermi model, n(r) has the form 


n(r) = Z2f(rZ1/3)b); 


all quantities in this and the following formulae are measured in atomic units. 
It is easy to see that the integral (139.3), when calculated with such a function 
n(r), will contain q only in the combination gZ~¥/:; 


F(q) = Z(bgZ-2'8). (139.9) 


Table 11 gives, for reference, the values of the function ¢(x), which holds 
for all atoms.t 


TABLE 11 


The atomic form factor on the Thomas—Fermi model 





With the atomic form factor (139.9), the cross-section (139.4) will have the 
form 


do = (4Z2/q4)[1 —d(bgZ-1/8) 2 do = Z2/8(Z-1/3y sin}9) do, (139.10) 


where (x) is a new function holding for all atoms. The total cross-section 
may be obtained by integration. The chief part in the integral is played by the 
range of small $. Hence we can write 


do ~ Z230(Z-1/8v9/2)2n9 d9, 


and extend the integration over $ to infinity: 


és m 
o = 2nZ?/3 [O(Z-1/8y9/2)9 d9 = (8/02)Z4/5 f x(x) de. 
0 0 
Thus co is of the form 
o = constant x 24/3 /v, (139.11) 


Similarly, it is easy to see that the constant y in formula (139.8) will be 
proportional to Z—1/%, 


+ It must be borne in mind that this formula ts not applicable for small q, since the integral 
of nr? cannot in practice be calculated by the Thomas—Fermi method (see the third note to 
§113). It should also be mentioned that the Thomas—Fermi model does not represent the 
individual properties of atoms or their systematic variation with atomic number. 
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PROBLEM 
Calculate the cross-section for the elastic scattering of fast electrons by a hydrogen atom 
in the ground state. 
SOLUTION. The wave function of the norma! state of the hydrogen atom is (in atomic 


=_ 2 


units) %4 = m-'¥%e-", so that n = e-*/m. The integration over angles in (139.3) is effected 
as in the derivation of formula (126.12); we have 


Pasa 
F = — Í n(r) singr.r dr = 
To 


i 
(+g) 
Substituting in (139.4), we obtain 
4(8+¢?)? 
o = ——— do 
(4+9?) 
where g = 2v sin $0. The total cross-section is calculated by putting do = 27 sinĵ dŷ = 


(27/v7)q dq and integrating over q from 0 to 2v; since v is assumed large and the integral 
converges, the upper limit may be replaced by infinity. The result is 


a = 7n/3v2. 


The transport cross-section is calculated as 


Changing the variable of integration by putting u = 4+ q?, and taking the upper limit as 
infinity everywhere except in the term du/u, we obtain 


4r 
Or = E (log vt af ? 
Y 


in accordance with (139.8). 


§140. Scattering with spin-orbit interaction 


Hitherto we have considered only collisions of particles whose interaction 
does not depend on their spins. Under these conditions the spins either do 
not affect the scattering process at all, or have an indirect influence due to 
exchange effects (§137). 

Let us now examine the generalization of the theory of scattering given in 
§123 to the case where the interaction of the particles depends significantly 
on their spins, as occurs in collisions of nuclear particles. 

We shall discuss in detail the simplest case, where one of the colliding 
particles (for definiteness taken to be the particle in the incident beam) has 
spin $, and the other (the target particle) has spin zero. 

For a given (half-integral) angular momentum j of the system, the orbital 
angular momentum can have only the two values Z = j +4, corresponding 
to states of different parities. In this case, therefore, the conservation of the 


absolute magnitude of the orbital angular momentum follows from that of j 
and the parity. . 


The operator f (§125) now acts not only on the orbital variables but also 
on the spin variables of the wave function of the system. It must commute 
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with the operator of the conserved quantity 12. The most general form of such 
an operator is 


f=a+6l.s, (140.1) 


where å and ô are orbital operators depending only on 12. 

The S-matrix, and therefore the matrix of the operator f, are diagonal with 
respect to the wave functions of states with definite values of the conserved 
quantities / and (and the component m of the total angular momentum), and 
the diagonal elements are expressed in terms of the phases & of the wave 
functions by formula (123.15). For given land given total angular momentum 
j = 1+4 or l—ẹ the eigenvalues of 1. s are $l and — $(/+ 1) respectively (see 
(118.5)). Hence, to determine the diagonal matrix elements of the operators 
â and b (denoted by a; and b;), we have the relations 


1 
ayt4dlb; = rae 
(140.2) 
ay—4(l+ 1h = (ee —1), 
i 


where the phases ô+ and ôm correspond to states withy = /+4 and; = /—4 
respectively. 

We are interested, however, not in the diagonal elements themselves of 
the operator f with respect to the states with given / and j, but in the scatter- 
ing amplitude as a function of the directions of the incident and scattered 
waves. This amplitude is still an operator, but only with respect to the spin 
variables—an operator which is non-diagonal with respect to the spin com- 
ponent o. In the rest of this section f will denote this operator. 

To derive this operator we must apply the operator (140.1) to the function 
(125.17) which corresponds to a plane wave incident along the z-axis. Thus 


f= 5 (21+ 1)(ay-+ bl . 8)P;(cos 6). (140.3) 
l=0 


Here we must also calculate the result of the action of the operator 1.8 on 
the function P;(cos 6). This can be done by writing 


i.s=2h64+18)+h6 


(see (29.11)) and using formulae (27.12) for the matrix elements of the oper- 


ators f, or still more simply by using the operator expressions (26.14), 
(26.15). The result is 


i. 8P,(cos @) = iv . 8P; (cos 6), 


where Pi! is the associated Legendre polynomial and v a unit vector in the 
direction nxn’ which is perpendicular to the plane of scattering (n being 
the direction of incidence (the z-axis) and n’ the direction of scattering, 
defined by the spherical polar angles @, ¢). 
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On determining a; and b; from (140.2) and substituting in (140.3), we 
have finally 


f= A+2Bv.§8, (140.4) 


As ee 3 [0 1)(e2#8e* — 1) +.1(e2" — 1) ]P,(cos 6) 
2ik A 
(140.5) 


B Ls 2i8:* _ e218) P (cos 6 
~ a2 — e248") P;l(cos 6). 


The matrix elements of this operator give the scattering amplitude for 
definite values of the spin component in the initial (o) and final (o’) states. 
Let us consider the cross-section summed over all possible values of o’ and 
averaged with respect to the probabilities of various values ø in the initial 
state (in the incident beam). The cross-section is given by 


do = (f*f oo do; (140.6) 


by taking the diagonal matrix elements of the product f +f we effect the 
summation over final states, and the bar denotes the averaging with respect 
to initial states.t If all spin directions are equally probable in the initial 
state, this averaging reduces to taking the trace of the matrix, divided by the 
number of possible values of the spin component o: 


do = 4 tr ( f+f) do. (140.7) 


On substitution of (140.4) in (140.6) the mean value of the square (v.s)? 
is calculated as 4v2s? = łs(s+1) = }. The result is 


dojdo = |4|2+|B|2+2 re(4B*)v . P, (140.8) 


where P = 25 is the initial polarization of the beam, defined as the ratio 
of the mean spin in the initial state to its maximum possible value (4). 
In the case of spin 4 the vector § completely describes the spin state (§59). 

It may be pointed out that the polarization of the incident beam leads 
to an azimuthal asymmetry of the scattering: owing to the factor v.P in 
the last term, the cross-section (140.8) depends not only on the polar angle @ 


t If the squared modulus | fon|? of the matrix element of some operator for the transition 
0 +n is summed over final states n, we have 


X | fon?” = © fon( fon)” =- È fon( f*)no 


= (ff *)oo. 


To avoid misunderstanding, it should be emphasized that the sign * denoting the con- 


jugate refers in (140.6) and henceforward to $ as a spin operator; in particular, the transposition 
of n and n’ is not implied. 
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but also on the azimuth ¢ of the vector n’ relative to n (if the polarization 
is not perpendicular to v, so that v.P # 0). 

The polarization of the scattered particles can be calculated from the 
formula 


P’ = 2 f tf ool f*F Joo (140.9) 


For example, if the initial state is unpolarized (P = Q), a simple calculation 
gives 
2 re(AB*) 


ei 140.10 
A+ |B ew 


Thus scattering leads, in general, to the appearance of a polarization per- 
pendicular to the plane of scattering. This effect is, however, absent in the 
Born approximation: if all the phases ô are small, the coefficient A is real 
in the first approximation with respect to the phases, and B is purely imaginary, 
so that re(AB*) = 0. 

The fact that the polarization P’ (140.10) is in the direction of v is obvious 
a priori. P’ is an axial vector, and v is the only axial vector which can be 
constructed from the available polar vectors n and n’. It is therefore evident 
that this property will also be possessed by the polarization resulting from the 
scattering of an unpolarized beam of particles with spin $ by an unpolarized 
target composed of nuclei with any spin (not necessarily zero). 

In formulating the reciprocity theorem for scattering in the presence 
of spins it must be borne in mind that time reversal changes the signs not 
only of the momenta but also of the angular momenta. Hence the symmetry 
of scattering with respect to time reversal must in this case be expressed by 
the equality of amplitudes for processes which differ not only in the inter- 
change of the initial and final states and the reversal of the directions of 
motion but also in that the signs of the spin components of the particles are 
changed in both states. Here, however, the signs of these amplitudes may 
differ because, according to (60.3), time reversal introduces a factor ( — 1)§~¢ 
in the spin wave function. This has the result that the reciprocity theorem 
must be formulated as follows:{ 


flor o2, n; 61’, 02’, n’) = (—1)E6-)f(— o, — 02’, —0'; — 01, — oe, —n).( 140.11) 


Here f(c, c2, n; 01’, o2, n’) is the amplitude of scattering with change 
in the spin components of the colliding particles from c1, oz to ay, Og. 
The sum in the exponent is taken over both particles before and after scatter- 
ing. 


+ Here we have in mind a target with a completely random distribution of spin directions. 
For s> 4, it will be recalled, the mean value of the spin vector does not fully determine the 
spin state, and if this mean value is zero there 1s not necessarily a complete absence of ordering 
of the spins. 

1 The derivation of this relation is similar to that of formula (125.12). The amplitudes of 
the ingoing and outgoing waves must contain spin factors, and instead of (125.10) we have the 
condition -SÊ = $, where É is an operator which not only effects inversion but also 
changes the spin state in accordance with (60.3). 
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In the Born approximation, the scattering has a further symmetry; the 
probabilities of processes differing by the interchange of the initial and 
final states, without change in the signs of the momenta and spin components 
of the particles as in time reversal, are the same (see §126). Combining this 
property with the reciprocity theorem, we find that the scattering is sym- 
metrical with respect to a change in sign of all the momenta and spin com- 
ponents, without interchange. Hence we easily conclude that in the Born 
approximation there can be no polarization ın the scattering of any unpolar- 
ized beam by an unpolarized target. For, under the transformation men- 
tioned, the polarization vector P changes sign, while the unit vector k x k’, 
whose direction must be the same as that of P, remains unaltered. Thus the 
property noted above for the scattering of particles with spin } by particles 
with spin zero is actually a general one. 

In the case of arbitrary spins of the colliding particles, the general formulae 
for the angular distributions are very complicated, and we shall not pause to 
derive them here, but merely calculate the number of parameters by which 
these distributions must be determined. 

The case considered above of a collision between particles of spin $ and 
0 has, in particular, the property that to given values of j and the parity there 
corresponds only one state of the system of two particles (apart from the 
unimportant orientation of the total angular momentum in space). Each 
such state leads to one real parameter (the phase ô) in the scattering ampli- 
tude. For other spins there are in general several different states with the 
same total angular momentum J anu parity; these states differ in the 
values of the total spin S of the particles and the orbital angular momentum 
l of their relative motion. Let the number of such states be n. It 1s easy to 
see that each such group of states contributes 4m(m+1) real parameters in 
the scattering amplitude. For the S-matrix is, with respect to these states, 
a matrix having unitary symmetry (owing to the reciprocity theorem), with 
n.n complex elements. The number of independent quantities in this matrix 
is conveniently calculated by noting that, if the operator S is written in the 
form $ = exp(iR), the unitarity condition is automatically satisfied when Å is 
any Hermitian operator (see (12.13)). If the matrix $ is symmetrical, so is the 
matrix R, which, being Hermitian, is therefore real, and a real symmetrical 
matrix has $n(m+ 1) independent components. 


As an example, for two particles with spins 4 the number n = 2: for 
given J there are in all four states, two with / = J and total spin S = 0 or 
1, and two with Z = J+1, S = 1. It is evident that two of these states are 
even (l is even) and two are odd (I is odd). 

The general form of the scattering amplitude for particles with spin 3, 
as an operator relating to the spin variables of the two particles, is easily 
written down from the necessary invariance conditions: it must be a scalar 
invariant under time reversal. To construct this expression we have the 
two axial vectors sı and sz of the particle spins and two ordinary (polar) 
vectors n and n’. Each of the operators 8) and & must appear linearly in the 
amplitude, since any function of an operator of spin 4 can be reduced to a 
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linear function. The most general form of operator satisfying these condi- 
tions can be written as 


f=A + B(ŝ1 . A)(S2. A)+ C(ŝ1 . p)(Se - u) + 
+ D(81 .v)(S2 y) + E(81 + $2) . v+ F(ŝ1 — £2) . v. (140.12) 


The coefficients A, B, ... are scalar quantities, which can depend only on the 
scalar n . n’, i.e. on the scattering angle @ (and on the energy); A, p, v are 
three mutually perpendicular unit vectors along n+n’, n—n’ and nx n’ 
respectively. The operations of time reversal correspond to the changes 


$1 > —S], So > —So, n->-—n’, n’ > —n, 
so that 


A> — À, u >p, v -> —Y 


and the invariance of the operator (140.12) is obvious. 

In the mutual scattering of nucleons (protons and neutrons) the last term 
in (140.12) does not appear. This is evident from the fact that the nuclear 
forces acting between nucleons conserve the absolute magnitude of the total 
spin S of the system; the operator $1 — $z, however, does not commute with 
the operator S2. (The remaining terms in (140.12) are expressed, according 
to (117.4), in terms of the total spin operator S, and therefore commute with 
§2.) In the scattering of like nucleons (pp or nn), the coefficients A, B, ... 
as functions of the angle of scattering also satisfy certain symmetry relations 
as a result of the identity of the two particles (see Problem 2). 


PROBLEMS 


PROBLEM 1. Determine the polarization after the scattering of particles with spin 4 by 
particles with spin zero when the polarization before scattering is non-zero. 


SoLuTION. A calculation using formula (140.9) is conveniently effected in components, 
with the z-axis in the direction of v. The result is 


— ((AP~|B)P+2|B)v(v.P)+2 im(AB*)v x P+ 2v re(AB*) 
= a e e a e E y 
|A|2+]B|2+2 re(4B*)v . P 


PROBLEM 2. Find the symmetry conditions satisfied by the coefficients in the scattering 
amplitude for two like nucleons, as functions ot the angle 8 (R. Oehme 1955). 


SoLUTION. We regroup the terms in (140.12) in such a way the: each is non-zero only 
for singlet (S = 0) or triplet (S = 1) states of the system of two nucleons: 


f= ai 82-4) bn -82+ D+ elt + G1 -VEz V) 
4-d{(@y - n)(&z -0') + (ê1 - n')(82 - n)]+e(1 +82) -v- (1) 
Using formulae (117.4), we easily see that the first term is non-zero only for S =O and the 
remainder only for S = 1. Owing to the identity of the particles, the scattering amplitude 


must be symmetric with respect to interchange of the particle coordinates for S = 0, and 
antisymmetric for S = 1. This transformation is equivalent to @ > 7—9, or to a change in 
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sign of one of the vectors n and n’ (cf. §137). From these conditions we obtain the relations 
a(r —6) = a(f), b(—6) = —O(6), (7-8) = —c(8), 

d(x—6) = d(8), e(7—6) = e(8). 


(2) 


Owing to isotopic invariance, the scattering amplitude is the same for nn and pp scattering 
and for np scattering in the isotopic state with T = 1. For the mp system, however, the state 
with T = 0 is also possible, and the np scattering amplitude is therefore described by other 
coefficients a, b,... in (1), which do not possess the symmetry properties (2). 


§141. Regge poles 


In §128 we have considered the analytical properties of the scattering 
amplitude as a function of the complex variable E, the energy of the particle; 
the orbital angular momentum / acted as a parameter having real integral 
values. Further properties of the scattering amplitude that are of metho- 
dological importance appear if we now regard / as a continuous complex 
variable for real values of the energy E.+ 

As in §128, we shall take radial wave functions whose asymptotic form 
(as r co) is 


i. 
xı = rR; = Al, E) exp( - Ar) 
l 


+ BUI, E) exp( Mi), (141.1) 


These functions are solutions of Schrédinger’s equation (32.8) (ın which / is 
now regarded as a complex parameter); the choice from the two independent 
solutions is governed by the condition 


R,; = constant xr? = for r>0. (141.2) 


It is immediately evident that this condition places a certain limitation on 
the permissible values of the parameter /: the general form of the solution of 
equation (32.8) for small r is 


Rı X cyr! + cor! -1 


(see the end of §32). In order for the second solution to be clearly distin- 
guished from the first solution and eliminated, the term in 7-}-1 must exceed 


that inv! as r +0. For complex J, this leads to the condition re l > re (—l-1), 
or 


re (1+4)>0. (141.3) 


In the following, we shall consider only this half of the complex /-plane, to the 
right of the vertical line/ = —}. 





t These properties were first investigated by T. Regge (1958). 
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The wave function R(r; /, E), being a solution of a differential equation with 
coefficients analytic in the parameter /, is an analytic function of /, having no 
singularities in the half-plane (141.3). This applies, in particular, to the 
asymptotic expression (141.1), and the functions 4(/, E) and B(/, E) therefore 
have no singularities with respect to /. Here, however, it is assumed that the 
retention of both terms in (141.1) as r > oc is in fact legitimate. When E>0 
this is always true; when E <0 it is true if the field U(r) satisfies the condition 
(128.6) or (128.13). In these arguments it is important that the form of the 
asymptotic behaviour (with respect to r) of the wave function depends on E 
but not on/. The approach to the asymptotic form is therefore unaffected 
by the fact that / is complex. 

Comparing (141.1) with the asymptotic formula (128.15), we find the 
S-matrix element in the form 


S(l, E) = exp[2i6(/, Ey] = e™! A(l, E): BU, E), (141.4) 


which is valid for complex / also (although the “phase shift’’ 6 is of course 
then not real). 

For real /, and E > 0, the functions 4 and Bare related by (128.4): A(/, E) = 
B*(1, E). Hence it follows that, for complex /, 


AUF SE} = BSL, Pym for E > 8; (141.5) 
and S(l, E) therefore satisfies the complex unitarity condition 
SAG ES", 2). = 1. (141.6) 


Since 4(l, E) and B(l, E) have no singularities as functions of J, the function 
S(i, E) and thus the partial scattering amplitude f (/, E) have singularities 
(poles) only at the zeros of the function B(/, E}. The poles of the scattering 
amplitude in the complex /-plane are called Regge poles. Their position 
depends, of course, on the value of the real parameter E. The functions 


b= Oe), 


which determine the positions of the poles, are called Regge trajectories; when 
E varies, the poles move along certain lines in the /-plane. The subscript 1 
which labels the poles will be omitted henceforward. 

Going on now to study the properties of the Regge trajectories, we shall 
show first of all that for E <0 all the (E) are real functions. To do so, let us 
consider the equation 


PEN EZ a(at | 
oaee u(y) - Sy m (141.7) 


which is satisfied b the wave function with! = a. Multiplying this equation 
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by x* and integrating with respect to r (with integration by parts in the first 
term), we obtain 





0 20 wo 
2) Z 
~ | be drt | (E- Ux? arafat 1) | BE ar = 
~ rY 
0 0 0 


Here we have used the fact that for B = 0 (the condition determining the 
Regge poles) the wave function decreases exponentially as z >æ, so that all 
the integrals converge. The first two terms in the above equation are real, 
and so is the integral in the last term. Hence we must have 


im a(at+i) = im (a+!) = 2re(a+!)ima = 0. 


But, since we are considering only poles in the half-plane (141.3), we certainly 
have re (a+ 4)> 0, and this gives the desired result 


ima(E)=0 for E<0. (141.8) 


Next, we proceed as follows with (141.7), in a similar manner to the 
derivation of equation (128.10): differentiate with respect to E, multiply by y, 
and multiply (141.7) by ¢y/cE, and subtract. This gives the identity 


, ex CONN iin, x? da(a +1) 
=o ae a ERE: = “+ a AEE s SER “= 0. 
|x > x( )] pe ae ae 





Integration with respect to r from 0 to æ, again using the fact that y >0 as 
r >œ, shows that the integral of the first term is zero, and we have 


dr. (141.9) 


Since we know that x is real, the wave function is also real, and both integrals 
in (141.9) are therefore positive. Hence 


d d 
gg t 1) = 2(a+4) —>0, 


and, since x+]>0, 
daid& >0 for F< 0. 


Thus, for E <0, the functions a(£) increase monotonically with £. 
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The negative values of E for which the functions o(£) take “physical” 
values (i.e. are integers / = 0, 1, 2, ...) correspond to the discrete energy 
levels of the system. Note that this gives rise to a new principle of classification 
of bound states, according to the Regge trajectories on which they lie. 

As an example, let us consider Regge trajectories for motion 1n an attractive 
Coulomb field. The scattering matrix elements are then given byT 


_ Pd+1—-i7k) 


Tis Tae) 


(141.10) 


with kin Coulomb units. The poles of this expression are at points where the 
argument of ['(/+1-—7/k) is a negative integer or zero. For E< Q we have 


k = iy/( — 2E), so that 


A= sA E BKO: (141.11) 


where 7a; = 0, 1, 2, ... is the number of the Regge trajectory. Equating o(£) 
to an integer / = 0, 1, 2, ..., we obtain the familar Bohr formula for discrete 
energy levels in a Coulomb field: 


E = —}(n,-+14+1)~*. 


The number nr here coincides with the radial quantum number which 
determines the number of nodes of the radial wave function. Each Regge 
trajectory (i.e. each given value of nr) corresponds to an infinity of levels with 
different values of the orbital angular momentum. 

Let us now consider the properties of the functions a(Z) for E >0. The 
functions A(J, E) and B(l, E) of the complex variable E in (141.1) are defined 
on a plane with a cut along the right half of the real axis (see §128). Corres- 
pondingly, the functions / = a(£), for which B(/, E) = 0, have a similar cut. 
On the upper and lower edges of the cut, «(£) has complex conjugate values, 
with im a >0 on the upper edge. Without pausing to give a formal proof of 
this, we shall present a more physical explanation of the reason. 

When / is complex, so is the centrifugal energy and therefore the effective 
potential energy U; = U+U(1+1)/2mr’?. Repeating the derivation in §19, 
we now have instead of (19.6) 


Liye div j = 2/¥]? im Ur 


When! = a, andima>0, we also have im U7 > 0. Then the right-hand side 
of the equation is positive, signifying an emission of new particles in the field 
volume. Accordingly, the asymptotic expression for the wave function 
(which, when B = 0, contains only the first term in (141.1)) must represent 





+ Cf. (135,11), in which the sign of k must be changed to convert from repulsion to 
attraction. 
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an outgoing wave, and this occurs on the upper edge of the cut; cf. the 
derivation of (128.3) from (128.1). 

Since, for E >0, the functions a(£) are complex, they cannot here take 
their “physical” values / = 0, 1, 2, .... They may, however, be close to these 
values in the complex /-plane. We shall show that there is then a resonance in 
the partial scattering amplitude (corresponding to the integral value of / in 
question). . 

Let lo be the integral value close to the function a(£), and let Eg be the (real 
and positive) value of the energy for which re a(£o) = lo. Then, near this 
value, we have 


o(E) = Ip +in + B(E— Eo), (141.12) 


where 7 = im a(Eo) is a real constant. We shall consider values (E) on the 
upper edge of the cut; according to the preceding discussion, 7 >Q in that 
case (andy <1, from the assumption that g is close to lo). It is easy to see that 
the constant f (i.e. the derivative da/dE for E = Eo) may be regarded as real 
and positive: since a(£) is almost real, so is the wave function y(r; a, E). 
Neglecting quantities of a higher order of smallness with respect to y, we can 
neglect the imaginary part of x, and then it follows that £ is positive, since 
the integrals in (141.9) are positive.f 

Since / = a(F)is a zero of B(1, E), the latter ts proportional to æ — I near the 
point a, Eo. Using (141.12), we therefore have 


B(lo, E) x constant x [a( E — Eo) + in]. (141.13) 


The form of this expression is the same as that of (134.6), with Eo the energy 
and l = 2n/a > 0 the width of the quasi-discrete level. Thus the closeness of 
the Regge trajectory (for E >0) to integral values of / corresponds to quasi- 
stationary states of the system. For these states, therefore, there exists the 
same principle of classification as for strictly stationary states: each Regge 
trajectory can correspond to a family of discrete and quasi-discrete levels. 

The treatment of / as a complex variable enables us to derive a useful 


integral form of the total scattering amplitude (for E > 0), given by the series 
(123.11): 


t To elucidate the structure of these integrals, we note that the asymptotic region r > a 
(where a is the range of action of the field), in which the expression (141.1) for the wave 
function is valid, makes only a small contribution to the integrals if ņ is small. For, if l = XE) 
is a zero of B(/,E), then by (141.5) Z = 2* is a zero of A(/,E). Hence A(x,E) and therefore 
x(7;x,E) in the region r > a, are small quantities ~ y/*; see (134.11). In estimating the 
integrals, it is also important that, on the upper edge of the cut (in relation to E), the wave 
function contains a factor e7: x(r:x,E) = A(a,E)e'*r. On this edge, we can regard E as 
E+15 (ô— +0); then k also has a small positive imaginary part, which ensures the con- 
vergence of the integrals in (141.9). Physically, the smallness of the contribution to the 
integrals from the region r > a is due to the fact that the energy E, corresponds to a quasi- 
stationary state (see below); the particle therefore reaches this region only as a result of an 
improbable decay of the state. The principal contribution to the integrals comes from the 
region r~a, in which the wave function ts almost real. 


t 
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oe a D+ ISU, E)—1]Pi(u), p= cos 6. (141.44) 


To obtain this, we must first define the functions P:(p) not only for integral 
1>0 but also for complex /. This can be done by taking P:(p) as the solution 
of equation (c.2): 

(1 — p?)Pr''(u) — 2p Pr'(u) + + 1) Pile) = 0 (141.15) 


with the boundary condition P;(1) = 1. The P:(u) thus defined as a function 
of l has no singularities for finite values of l.t 

It is easily seen that the series (141.14) is equal to the integral 
21+ 1 


sin al 





s= bse. E)—1]P(—p)dl, (141.16) 


taken along a contour C that passes in a negative direction (clockwise) round 
all the points/ = 0, 1, 2, ... on the real axis and is closed at infinity: 





All the poles / = a, a, ... of the function S(/, E} (which are not on the real 
axis if E >0) must remain outside the contour C. The integral (141.16) 
reduces to — 277 times the sum of the residues of the integrand at the points 
l = 0,1, 2, ..., which are poles of the function 1/sin 7l, and the residues of 
this function itself are (—1)!'/7. Since for integral Z we have P)(—p) = 
(—1)'Pi(u), we arrive at (141.14) from (141.16). 


PROBLEM 
Show that the phase shifts corresponding to successive integral values of / satisfy the inequality 
6; , (E) — ô (E) < 37. 


SoLution. We will regard / as a continuous real variable, and differentiate (32.10) with respect 


to it: 
ox" 2m I(l +1) | ex 
Sica ET | ST DB me 2, 
al +| ae E CARERE 


Multiplication of this by x and the original equation by ĝx/ĉl, and subtraction, gives 
[xôx']ðl — x'ðxiðly = (2 + 1) x? }r?. 
We integrate this over r from 0 to œ. For r = 0 the expression in the square brackets is zero, and 


for 7 = œ we can use for y the asymptotic expression (33.20). The result ts 
x 


4k (in — 06,/01) = (21 + 1) fær dr > 0, 


D 
so that @6,/01 <in. Integration over / from l to / + | gives the required inequality. Together with 
(133.17), it shows that the number n; of discrete levels does not increase with /. As E — œ and the 
Born approximation becomes valid, the scattering phases tend to zero, so that 6,(00) = 0. Then 


mı — % = (1/7) [ô ,(0) — ô (0)] Sa nı — SO. 





+ By comparison of (141.15) with (e.2) we can express Pi(u) as a hypergeometric function: 
Pi(p) = F(—1,14+1, 1; 4-4p). 
t A more detailed account of the ideas discussed in this section (for non-relativistic theory) 
is given in the book by de Alfaro and Regge quoted in §123. 


CHAPTER XVIII 


INELASTIC COLLISIONS 


§142. Elastic scattering in the presence of inelastic processes 


COLLISIONS are said to be inelastic when they are accompanied by a change 
in the internal state of the colliding particles. Here we understand “ʻa change 
in the internal state” in the widest sense; in particular, the very nature of 
the particles may be altered. For example, the change may consist in the 
excitation or ionization of atoms, the excitation or disintegration of nuclei, 
and so on. Where a collision (e.g. a nuclear reaction) may be accompanied 
by various physical processes, these are referred to as various channels of the 
reaction. 

The existence of inelastic channels has a certain effect on the properties 
of elastic scattering also. 

In the general case where various reaction channels exist, the asymptotic 
expression for the wave function of the system of colliding particles is a sum, 
with one term corresponding to each possible channel. Among these there 1s, 
in particular, a term describing the particles in the original unchanged state 
(the input channel). This is the product of the wave functions of the internal 
state of the particles and a function describing their relative motion (in a 
coordinate system where their centre of mass is at rest). The latter function 
is the one of interest here; we shall denote it by ¥, and seek its asymptotic 
form. 

The wave function Ņ in the input channel consists of an incident plane 
wave and an outgoing spherical wave corresponding to elastic scattering. It 
can also be represented as the sum of an ingoing and an outgoing wave, as 
in §123. The difference is that the asymptotic expression for the radial 
functions R(r) cannot be taken in the form of the stationary wave. The 
Stationary wave is the sum of ingoing and outgoing waves of equal amplitude. 
In purely elastic scattering this corresponds to the physical significance of 
the problem, but when there are inelastic channels the amplitude of the out- 
going wave must be less than that of the ingoing wave. The asymptotic 
expression for y will therefore be given by formula (123.9): 


1 oD 
b= 5 > (21+ 1)Pi(cos 6){(—1)!+1e-tkr + Syetkr}, (142.1) 
1=0 


except that the S; are no longer given by (123.10), but are certain quantities, 
in general complex, with moduli less than unity. The elastic scattering 
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amplitude is given in terms of these quantities by formula (123.11): 
[s a) 
6) = ! 21+ 1)(S;—1)P a2 
©) = mm 2 + 1)(Si— 1)Pi(cos 6). (142.2) 


For the total elastic scattering cross-section oe we have, instead of (123.12), 
the formula 


= ie 2) 
ee 2e) A (142.3) 
1=0 


The total inelastic scattering cross-section or reaction cross-section cy for all 
possible channels can also be expressed in terms of the Sı. To do so, we 
need only note that for each value of / the intensity of the outgoing wave 1S 
reduced in the ratio |Sj|2 in comparison with that of the ingoing wave. This 
reduction must be ascribed entirely to inelastic scattering. It is therefore 
clear that 


qT = 9 
or = z2 (2 +1) — |S”), (142.4) 
1=0 
and the total cross-section 1s 


ie 8) 


> (21+ 1)(1 = re Si). (142.5) 


=0 


2a 
k2 





öt = Ce + Cr = 


The partial amplitude for elastic scattering with angular momentum A 
determined from (123.15), is 


fi = (Si— 1/21, (142.6) 
and each of the terms in the sum in (142.3) and (142.4) is the partial cross- 


section for elastic or inelastic scattering of particles with angular momen- 
tum I: 


oi = (nfh2)(21-+ 1)|1 — Sif, 
o = (/k2)(214+-1)(1 — | Sil’), (142.7) 
of) = (2n/k2)(21+ 1)(1— re Si). 
The value S; = 1 corresponds to the complete absence of scattering (with 
a given 1). The case S; = 0 corresponds to total “absorption” of particles 


with angular momentum / (there is no outgoing partial wave with this / in 
(142.1)); the cross-sections for elastic and inelastic scattering are then equal: 


of!) = ol!) = (a/k2)(21+4 1), (142.8) 
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Although elastic scattering can occur without inelastic scattering (when 
|S:| = 1), the opposite situation is impossible: the presence of inelastic 
scattering necessarily implies the simultaneous presence of elastic scattering. 
For a given value of or”, the elastic scattering cross-section must be in the 
range 


Vo— vy (0-69) < Vo!) < 00+ vleo- o), (142.9) 


where oo = (21+ 1yr{k?. 


Taking the value of f (6) from (142.2) for 6 = 0 and comparing with (142.5), 
we find 


im f(0) = katj4r, (142.10) 


which is a generalization of the optical theorem (125.9). Here f (0) is again 
the amplitude of elastic scattering through zero angle, but the total cross- 
section oz includes the inelastic component. 

The imaginary parts of the partial amplitudes fz are related to the partial 
cross-section o by 
k ae 


im f = — ; 
fi 4n 2141 





(142.11) 


which follows directly from (142.6) and (142.7). 

The fact that the coefficients S; in the asymptotic expression for the wave 
function are not of unit modulus does not affect the conclusions of §128 
concerning the singular points of the elastic scattering amplitude as a function 
of complex E. These conclusions remain valid when inelastic processes 
occur. The analytical properties of the amplitude are, however, changed in 
that it is no longer real on the negative real axis (E < 0), and its values on the 
upper and lower sides of the cut for E > 0 are not complexconjugate quantities 
(and accordingly its values at all points in the upper and lower half-planes 
symmetrical about the real axis are not complex conjugate quantities). 

When we go from the upper edge of the cut to the lower edge by passing 
round the point E = 0, the quantity 4/E changes sign, i.e. this process 
changes the sign of the quantity k, which is real (for E > 0). The ingoing 
and outgoing waves in (142.1) are interchanged, and so the coefficient S; is 
replaced by its reciprocal 1/S; (which is not equal to S;"). The amplitudes 
fi on the upper and lower edges of the cut may be denoted by fi(k) and f;( — k) 
(only f(k) is a physical amplitude, of course). According to (142.6) we have 





CPER WSE 
De se f(Sh)= — . 
ia 21k peat) 2k 
Eliminating S; from these two equations gives 
Fuk) — fil — R) = 2tkfi(k)fi( —k); (142.12) 


in the absence of inelastic processes, f ( — k) = f*(k), and the relations (142.11) 
and (142.12) are the same. 
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Writing (142.12) in the form 
1 1 
fdk) f-k) 


we see that the sum 1/f;(k)+7k must be an even function of k, and ìf this is 
denoted by g:(k?), then 





= — 21k, 


f(k) = (142.13) 


p(k?) — ik 
The even function g;(k2), however, is not now real as it was in (125.15).t 

When a beam of particles passes through a scattering medium consisting 
of a large number of scatterers, it is gradually attenuated owing to the re- 
moval of particles from it which undergo various collision processes. This 
attenuation is entirely determined by the amplitude of elastic scattering 
through zero angle and, under certain conditions (see below), can be described 
by the following formal method. 

Let f(0,E) be the amplitude of scattering through angle zero by each 
individual particle of the medium. We shall suppose that f is small in com- 
parison with the mean distance d ~ (V/N)' between the particles. Then the 
scattering by each particle may be considered separately. We use as an 
auxiliary quantity an ‘‘effective field” Uert of a fixed centre, so defined that 
the Born scattering amplitude for scattering through angle zero in this field 
is equal to the actual amplitude f(0,£); this does not mean, ofcourse, that 
the Born approximation can be used to calculate f(0,£) from the actual 
interaction of the particles. Then, by definition, we have (see (126.4)) 


27h? 
| Gaut = — —f(0,E), (142.14) 
m 


where m is the mass of the scattered particle. The field thus defined is, like 
the amplitude f, complex. The relation between its range of action a and 


the quantity Uert is obtained from an estimate of the two sides of equation 
(142.14): 


a@ Uert ~ he fim. (142.15) 
The definition (142.14) is, of course, not unique. We shall impose the 


further condition that the field Uert satisfies the condition for perturbation 
theory to be applicable: 


| Uen| € h?/ma’, (142.16) 





+ The foregoing arguments, and the conclusion that the function gi 1s even, assume that the 
interaction decreases sufficiently rapidly as 7 -> X, so that there are no cuts in the left half-plane 
of E and a complete circuit round the point E = 0 is possible. 

t The following treatment can be used, in particular, for the description of scattering of fast 
neutrons (with energies of the order of hundreds of MeV) by nuclei, the wavelengths of such 
neutrons being so small that the nucleus may be regarded as an inhomogeneous macroscopic 
medium. 
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with | f| < a. It is easy to see that the attenuation of the scattered beam can 
then be described as the propagation of a plane wave in a homogeneous 
medium in which the particle has a constant potential energy given by 


eee UN 
Ua =- Í Usnd V 
2 
= ————f(0,F), (142.17) 
V m 


which is obtained by averaging the effective fields of all N particles in the 
medium over its volume V. This becomes evident if we first consider 
scattering by a region of the medium which contains many scattering centres 
but for which the scattering effect is still small; the possibility of selecting 
such regions 1s ensured by the condition (142.16). The attenuation of the 
beam on passing through such a region 1s determined by the amplitude of 
scattering through angle zero, which in turn is determined, in the Born 
approximation, by the integral of the scattering field over the volume of the 
scattering region. This means that. the scattering properties of interest here 
are entirely determined by the field (142.17) averaged over the volume of the 
medium. 

Thus the beam of particles passing through the medium can be described 
by a plane wave ~ etkz with wave number 


1 le 
k= F a/{2m(E— Uen)]. 


In terms of the wave number ko = 4/(2mE)/h of the incident particles, we 
can write k in the form nko, where the quantity 





Uer 
n=v(l-—) 
E 142.18 
=4/( ee a), ( 18) 


plays the part of a “refractive index” of the medium with respect to the 
beam of particles passing through it. It is in general complex (the amplitude 
f being complex) and its imaginary part gives the attenuation of the beam 
intensity. If E> |U., then (142.18) gives, as it should, 





N wh? 


im 72 = — 
V mE 





im f(0,E) 
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where g; is the total scattering cross-section, and we have used the optical 
theorem (142.10). This expression corresponds to the obvious result that the 
intensity of the wave is damped according to the law 


ikz|2 . ep-Noiz/V 
|eikz|2 ~ e : 


As well as the absorption, the refractive index (142.18) also determines (by 
its real part) the law of refraction of the beam on entering and leaving the 
scattering medium. t 


PROBLEM 


Neutrons are scattered by a heavy nucleus whose radius a is large compared with the wave- 
length of the neutrons (ka ® 1). It is assumed that all neutrons incident with orbital angular 
momentum | < ka = lo (i.e. with impact parameter p = ħljmv = ljk <a) are absorbed 
by the nucleus, while those with / > lo do not interact with it at all. Determine the cross- 
section for elastic scattering through small angles. 


SOLUTION. Under the conditions stated, the motion of the neutrons is mainly quasi- 
classical, and elastic scattering results from a slight deflection entirely analogous to Fraun- 
hofer diffraction of light by a black sphere. The required cross-section can therefore be 
written immediately from the known solution of the diffraction problem :f 


J? (ka) 
doe = 7a2—_— do. 
nb? 


The same result can also be derived from (142.3). According to the conditions of the 
problem, we have Sı = 0 for /</, and Sı = 1 for l>l,. The elastic scattering amplitude is 
therefore 


lo 
1 
E et Pi{cos6). 
KO = 2. (21+ 1)P;(cos8) 


The chief part in the sum is played by the terms with large I. We therefore write 2/ in place 
of 2/+1, use the approximate expression (49.6) for Pi (cos 8) with 8 small, and change from 
summation to integration: 


8 E a él)dal 
=| o(6l) 


; 
= —loJ (0l 
m 1(8lo) 





+ An interesting example of the application of (142.17) is the displacement of the highet 
levels of an alkali-metal atom in a gas. In a highly excited state, the valency electron is at a 
mean distance 7 from the centre of the atom that is large compared with the dimensions @ of 
both the rest of the atom and the neutral gas atoms. The latter atoms within a sphere of 
radius ~F act as scattering centres for the valency electron and shift its energy level by an 
amount (142.17). Since the de Broglie wavelength of the excited valency electron is also large 
in comparison with a, the amplitude f(0, E) = —a, where « is the scattering length; cf. 
(132.9). Thus this effect displaces the levels by a constant amount 2ah*a vim, where m is the 
electron mass and » the number density of the gas particles (E. Fermi 1934). i 

t See Fields, §61, Problem 3 (the problem of diffraction from a black sphere is equivalent 
to that of diffraction from a circular aperture cut in an opaque screen). The cross-section 1s 
obtained by dividing the intensity of the diffracted waves by the incident flux density. 
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= (ia/6)J,(kaé), 


as it should be.t 
The total elastic scattering cross-section is 


foe) 


J?(kaé 
Ce = nat | ae m8 db 


T 





0 


= na’; 


the integration can be extended to infinity because of the rapid convergence. This is the 
result to be expected under the conditions stated (cf. (142.8)), and is the same as the 
absorption cross-section, simply the geometrical cross-section of the sphere. The total cross- 
section of = 27a?. 


§143. Inelastic scattering of slow particles 


The derivation of the limiting law of elastic scattering at low energies given 
in §132 can easily be generalized to the case where inelastic processes are 
involved. 

As before, the scattering with / = 0 is the most important at low energies. 
According to the results of §132, the corresponding element of the S-matrix is 


So = e280 ~ 14 2189 = 1 — tka. 


The properties of the wave function described in §132 are changed only in 
that the condition imposed on it at infinity (the asymototic form (142.1)) is 
now complex, instead of the real stationary wave which occurs in the case 
of purely elastic scattering. The constant a = —c2/c, is therefore complex 
also. The modulus | So | is no longer equal to unity; the condition |So | < 1 
means that the imaginary part of a = «'+i«” must be negative («” < 0). 
Substituting So in (142.7), we find the cross-sections for elastic and in- 
elastic scattering: 
Oe = 4rr|a]?, (143.1) 


or = 4r|a"|jk. (143.2) 


Thus the elastic scattering cross-section is again independent of velocity, but 
the inelastic cross-section is inversely proportional to the particle velocity— 
the 1/v law (H. A. Bethe 1935). Consequently, as the velocity diminishes, 


inelastic processes become more and more important in comparison with 
elastic scattering. 


t A similar discussion can be given for the problem of diffraction scattering of fast charged 
particles by a “black” nucleus. The limiting value Z, must here be determined from the con- 
dition that the shortest distance between the nucleus and a particle moving along a classical 
path in a Coulomb field is just equal to the radius of the nucleus. For / <la we must again 
put Sı = 0, and for /> J, Sı = e7, where & are the Coulomb phases given by (135.11) 
See A. I. Akhiezer and I. Ya. Pomeranchuk, Some Problems of Nuclear Theory (Nekotorye 
voprosy teorii yadra), Gostekhizdat, Moscow 1950, §22; Journal of Physics 9, 471, 1945. 

t The velocity dependence ot the partial reaction cross-sections for various non-zero orbital 
angular momenta / can be determined similarly. It is given by or! ~ k%-1. The elastic 
scattering cross-sections Ge‘) are, as before, proportional to k4!, i.e. they decrease more rapidly 
than or“) with the same / as k > 0. ° 
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The limiting laws (143.1) and (143.2) are, of course, only the first terms 
of expansions of the cross-sections in powers of k. It is interesting to note that 
the next term in the expansion for each cross-section contains no constants 
other than those which appear in (143.1) and (143.2) (F. L. Shapiro 1958). 
This result is due to the fact that the function go(k2), in the expression (142.13) 


folk) a. oa 


for the partial scattering amplitude (J = 0), is even. For small k this function 
can therefore be expanded in even powers of k, and the term following 
go = —l/ais ~k?. If we neglect this term, we can still write two terms of 
the expansion in fo(k): 

fo(Rk) = — a(1 — tka). 


Correspondingly we can retain the next terms of the expansions in the cross- 
sections, for which the following expressions are easily obtained: 


Ge = 42] a|2(1 — 2kļ|æ"|), (143.3) 
Or = 4r|a”|(1 — 2kja”|)/k. (143.4) 


These results assume a sufficiently rapid decrease of the interaction at 
large distances. We have seen in §132 that the elastic scattering amplitude 
tends to a constant limit as k > 0 if the field U(r) decreases more rapidly 
than 7-8. This is a necessary condition also for the validity of the analogous 
result (143.1) when inelastic channels are present. 

The 1/v law for the reaction cross-section is subject to a weaker condition: 
the field must decrease more rapidly than r~2, as is clear from the following 
intuitive derivation of the 1/v law. 

The probability that a reaction will occur in a collision is proportional to 
the squared modulus of the wave function of the incident particle in the 
“reaction zone” (in the region r ~ a). Physically, this statement expresses 
the fact that, for example, a slow neutron colliding with a nucleus can bring 
about a reaction only if it “penetrates”? into the nucleus. If the interaction 
decreases more rapidly than 772, it does not change the order of magnitude 
of the wave function between large r and r ~ a; in other words, the ratio 
| ¥(a)/( œ) |2 tends to a finite limit as k > 0 (this is seen from the fact that 
the term Uy in Schrédinger’s equation is small compared with Ay). The 
reaction cross-section is obtained by dividing |4|? by the current density. 
If w is taken as a plane wave normalized to unit current density, we have 
|Y|2~1/v, the required result. 

In collisions of charged nuclear particles, there is a slowly decreasing 
Coulomb field in addition to the short-range nuclear forces. The Coulomb 
field may considerably alter the magnitude of the incident wave in the 
reaction zone. The reaction cross-section is found by multiplying 1/v by 


+ The formula (143.3), which takes into account the next term in the expansion in powers of 
k, requires that U should decrease more rapidly than r~‘*. 
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the ratio of the squared moduli of the Coulomb and free wave functions as 
r—0. This ratio is given by formulae (136.10), (136.11). The result is 
(in Coulomb units) 
27A 
R2lex2n/k_ 1] ’ 


(143.5) 


the plus sign in the exponent corresponds to repulsion and the minus sign to 
attraction. 

The coefficient A is the constant in the ljv law: ıt the velocity is large 
compared with the Coulomb unit (k> 1), the Coulomb interaction plays no 
part and we return to the law a, = A/k. 

If the velocity is small compared with the Coulomb unit (k < 1, or in 
ordinary units 2 Z2e"/hiv ® 1, where Zye, Zoe are the charges of the colliding 
particles), the Coulomb interaction is predominant in determining the magni- 
tude of the wave function in the reaction zone. Then for 2 collision between 
attractive particles 


or = 27A/k?, (143.6) 
and for acollision between repulsive particles 
Or = (27 A/k2)e—2n/k (143.7) 


In the latter case, the cross-section tends to zero as k->0. The exponential 
factor by which (143.6) and (143.7) differ is the probability of passage through 
the Coulomb potential barrier: in ordinary units it is exp( — 2r Z, Zae? /hv). 

Note that the limiting law (143.6) applies not only to the total cross- 
section but also to the partial cross-sections with each angular momentum Lf 
This is seen from the fact that in the expansion (136.1) of the functions y{) 
(which appear in the formulae (136.10) and (136.11) used above) the functions 
Rxi in every term of the sum have the same limiting dependence on k: in the 
limit k +0, the radial functions (for the case of attraction) are given by the 
expressions (36.25), and near the centre we have Ry;~4/kr!. The contri- 
butions of the individual angular momenta to the square of the wave function 
in the reaction zone are ~a2!/h, i.e. all depend on kin the same way, although 
they are reduced by the small factor (ajae), where a, = h?2/m Z, Zoe? is the 
Coulomb unit of length. 


$144. The scattering matrix in the presence of reactions 


The cross-section Or considered in §§142 and 143 was the total Cross- 
section for all possible inelastic scattering channels. We shall now describe 


the derivation of the general theory of inelastic collisions, in which each 
channel can be considered separately. 


We shall suppose that, as a result of the collision of two particles, two 





+ The same is true of (143.7). 
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particles (which may be the same or different ones) are formed. We number 
all possible reaction channels (for a given energy), and denote quantities 
pertaining to them by appropriate sufhxes. 

Let channel z be the input channel. The wave function of the relative 
motion of the colliding particles (in the centre-of-mass system) in this 
channel is given by the sum already mentioned of the incident plane wave 
and the elastically scattered outgoing wave: 
etk;r 


pi = etk + fulb) 





(144.1) 


r 


The square of the amplitude fi; gives the cross-section for elastic scattering 
in channel 7: 


do; = | f ul?do. (144.2) 
In other channels (suffix f ) the wave functions of the relative motion of the 


particles represent outgoing waves. As explained above, these waves are 
conveniently represented in the formt 


br = fri) J 





(144.3) 


ee ? 
mi Y 


where ky is the wave vector of the relative motion of the reaction products in 
channel f, 6 the angle between it and the z-axis, and m, my the reduced 
masses of the two initial and two final particles. The scattered flux in the 
solid angle do is obtained by multiplying the square | yr|? by vsr?do, and the 
cross-section for the corresponding reaction is found by dividing this flux by 
the incident flux density, which is v;. Thus 


doy: = | frl don, (144.4) 
{ 


where the momenta p; = miv, pf = mpor. 

In §125 we have defined the scattering operator S, which converts an in- 
going wave into an outgoing one. When several channels are present, this 
operator has matrix elements for transitions between different channels. 
The elements which are ‘‘diagonal’’ with respect to the channels correspond 
to elastic scattering, and the non-diagonal elements correspond to various 
inelastic processes. All these elements remain operators with respect to the 
other variables. They are determined as follows. f 

Similarly to the method used in §125, we define operators fia, fri related 
to the amplitudes fiz, fri, by 


Sr = Spit 2iv/(kikf)fji- (144.5) 
It is easily seen that with this definition we obtain an S-matrix which must 


+ Here we again denote the initial state of the system by the suffix f and the final state by f (cf. 
the first footnote to §41). In the scattering amplitude, the suffix for the final state is written 
to the left of that for the initial state, in accordance with the placing of the suffixes in the matrix 
elements. For uniformity, the suffixes in the cross-sections will be put in the same order. 
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Satisfy the unitarity condition. For we can write the wave function in the 
input channel as a set of ingoing and outgoing waves, as in §125: 














i i? E å ikyr 
enem = Cg 
ra/ry T/ Vi 
ikyr ikir 
E e Be A a o a acl (144.6) 
rV/V; ravi 


Here, for convenience, we have introduced a further factor 1/4/v; in com- 
parison with (125.3). Then, with the amplitudes as defined above, the wave 
function in channel f is 








= my f F 7 etk sr 
wy Ziki Jri ( D d; 
ik pr 
Gahan. (144.7) 
r/vy 


The flux in the ingoing waves must be equal to the sum of the fluxes in 
the outgoing waves in all channels. This requirement expresses the obvious 
condition that the sum of the probabilities of all processes (elastic and in- 
elastic) which can occur in the collision must be unity. On account of the 
factor 4/v in the denominators of the spherical waves, the velocity does not 
appear in the flux densities in these waves. The above cr:-dition therefore 
means simply that the normalizations of the ingoing wave and the assembly of 
outgoing waves must be the same. It is consequently again expressed by the 
condition of unitarity of the scattering operator, regarded as a matrix, with 
respect to (in particular) the channel numbers. For the operator fy; this 
condition becomes 


fn- = 2i > Raf inf (144.8) 


which is analogous to (125.7). The index + denotes taking the complex 
conjugate and transposing with respect to all the matrix suffixes except the 
channel number, 

The S-matrix is diagonal with respect to states having definite values of 
the orbital angular momentum /; the corresponding matrix elements are 
distinguished by the index (/). By applying the operators fig and fr to the 
function (125.17), we obtain the amp litudes for elastic and inelastic scattering 
processes in the form 


j ke s 
KE prete 1)(Su:® ~1)P,(cos 8), 
(144.9) 


1 
n a rime (2 
Ir ihe a I) S71 Pifcos 8). 
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The corresponding integral cross-sections are 


= © 
ou = > N (2+ 11 - Sie |2, 
Be. 
(144.10) 
T 
ofi = aa), l+ 1) Spel}. 


i l=0 


The former is the same as (142.3). The total reaction cross-section or (from 
input channel 2) is 


Cr = > oji 
f 


taken over all f # i. Since the S-matrix is unitary, we have 
>, Srl? = 1-lSul?, 
f 


which gives formula (142.4) for or. 
The symmetry of the scattering process with respect to time reversal (the 
reciprocity theorem) is given by the equation 


Spi = Sifo (144.11) 
or, what is the same thing, 


fr = for. (144.12) 


The symbols i* and f * denote states which differ from 7 and f by a change 
in the signs of the momenta and spin components of the particles; they are 
said to be time-reversed relative to the states i and f. The relations (144.11) 
and (144.12) generalize formulae (125.11) and (125.12) for elastic scattering. [ 

Equation (144.12) leads to the following relation for the reaction cross- 
sections: 


dopilpr dof = does. /pi2d ris (144.13) 


This expresses the principle of detailed balancing. 


t For complex particles (atoms and nuclei) the “spin” is here to be taken as the total 
intrinsic angular momentum, consisting of the spins and the orbital angular momenta of the 
internal motions of the constituent parts. 

t Here we omit the factor —1 which may appear in collisions of particles having spin (cf. 
(140.11)). This, of course, does not affect formula (144.13) for the cross-sections. 
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It has been mentioned in §126 that, if perturbation theory is applicable, 
then in the first approximation we have not only the reciprocity theorem but 
also a further relation between the amplitudes of the direct and reverse 
processes (in the literal sense), :—> jf and f—>i. This property, expressed 
by the equation ffi = fis*, holds good for inelastic processes (in the same 
approximation). The corresponding cross-sections are then related by 


doy; _ dois 





pyedoy pido; iii 

The difference between the transitions 7 > f and 7* > f* no longer exists 
if we consider the cross-sections integrated over directions of py and summed 
over directions of the spins siy, ser of the resulting particles and averaged over 
the directions of the momentum p; and spins 513, s2; of the initial particles. 
Let this cross-section be 


a 1 
ra eee > do pfdoż; 


the sum ıs taken over the spin components of all particles, and the factor 
before the sum and integral is due to the fact that we average, not sum, over 
quantities pertaining to the initial particles. Writing (144.13) in the form 


Pirdoszidow = prrdojzs.dos 
and effecting the integrations and summations, we obtain 
gpi Tfi = eps Tis. (144.15) 


Here 
gi = (253744 1)(250;4 1), Er (2517+ 1)(2se¢+ 1); (144.16) 


these determine the numbers of possible spin orientations of the initial pair or 
the final pair of the particles, and are called the statzstical weights of the states 
i and f. 

Finally, we may note the following property of the amplitudes fr. We 
have seen in §140 that the cross-section oy; varies as 1/p; when p;—> 0 (if 
the interaction decreases sufficiently rapidly at large distances). According to 
formula (144.4), this means that ff; > constant as pi2O0. Hence it follows 


from the symmetry property (144.12) that ffi tends to a constant limit as 
py-> Oalso. We shall return to this result in §147. 


§145. Breit and Wigner’s formulae 


In §134 we have introduced the concept of quasi-stationary states as being 
those which have a finite but relatively long lifetime. A wide class of such 
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states arises in the field of nuclear reactions at not too high energies which pass 
through the stage of formation of a compound nucleus. 

An intuitive physical picture of the processes occurring is that the particle 
incident on the nucleus interacts with the nucleons in the nucleus and 
‘‘coalesces’’ with them, forming a compound system in which the energy 
contributed by the particle is distributed between many nucleons. The 
resonance energies correspond to the quasi-discrete levels of this compound 
system. The long lifetime of the quasi-stationary states (compared with the 
periods of the motion of the nucleons in the nucleus) is due to the fact that 
for the greater part of the time the energy is distributed between many 
particles, so that none of them has sufficient energy to overcome the attraction 
of the other particles and leave the nucleus. Sufficient energy for this purpose 
is only comparatively rarely concentrated on one particle. The disintegration 
of the compound nucleus can then take place in various ways corresponding 
to the various possible reaction channels. } 

This description of such collisions shows that the possibility of inelastic 
processes does not affect the potential part of the elastic scattering amplitude, 
which is not related to the properties of the compound nucleus (see §134); 
inelastic processes change only the resonance part of the elastic scattering 
amplitude. For the same reason the amplitudes of inelastic scattering pro- 
cesses which pass through the stage of formation of the compound nucleus 
are purely resonance in character. The resonance denominators of all ampli- 
tudes which relate to the vanishing of the coefficient of the ingoing wave for 
E = Eo—3#I retain their form (E— Eo+4:1), I being again the total proba- 
bility of decay of any given quasi-stationary state of the compound nucleus. 

These arguments, together with the unitar:.y condition which must be 
satisfied by the scattering amplitudes, are sufficient to establish the form of 
these amplitudes. 

The calculations may conveniently be made in a symmetrical form by 
numbering all possible channels of disintegration of the compound nucleus 
and not specifying beforehand which of them is the input channel for the 
reaction concerned. The sufhxes denoting the channel numbers will be 
represented by a, b, c,.... We shall also consider the partial scattering 
amplitudes corresponding to the value of / for the quasi-stationary state in 
question.|| We accordingly seek these amplitudes in the form 


l T Mab 
— (eta — 1)8gp— —ei lotio) —______ (145.1) 


D) = 
k ika 24/(kako) EAT 





(the index (/) to the constants 5, and Mab is omitted for simplicity). The 


+ The concept of the compound nucleus is due to N. Bohr (1936). | | 

t The competing reactions include also radiative capture of the incident particle, in which 
the compound nucleus goes from an excited state to its ground state with the emission of a 
y-quantum. This process is also “slow”, owing to the relatively low probability of the 
transition with emission. l 

l) We shall at first ignore the complications which arise from the spins of the particles 
involved in the process. 
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first term appears only if a = b, and represents the amplitude of potential 
elastic scattering in channel a; the constants 6, are the same as the phases 
§; which appear in (134.12). The second term in (145.1) corresponds to 
resonance processes. The form of the coefficient of the resonance factor in 
this term is chosen so as to simplify the result of applying the unitarity 
conditions (see below). 

Since we are considering scattering for a given value of the absolute 
magnitude of the orbital angular momentum, a quantity which does not 
change sign under time reversal, the reciprocity theorem (symmetry with 
respect to time reversal) is expressed simply by the symmetry of the ampli- 
tudes fap with respect to the suffixes a and b. Hence it follows that the 
coefficients Mab must also be symmetrical (Mab = Mpa). 

The unitarity conditions for the amplitudes fap are 


im fap = Ekefac fret ; (145.2) 
(a 


cf. (144.8). Substituting the expressions (145.1), we find after a straight- 
forward calculation 


M ab” Mab il 2 Mac! be” 


e o e e o 


If this equation is satisfied identically for all energies E, we must have first 
of all Mab = Map*, i.e. the quantities Vap are real. We then find 


Mav = EMacMoe, (145.3) 


i.e. the matrix of coefficients Ma must be equal to its own square. 

The real symmetrical matrix Ma can be brought to diagonal form by a 
suitable orthogonal linear transformation U. Denoting the diagonal elements 
(eigenvalues) of the matrix by M@), we can write this transformation in the 
form 


2 Uaa UgoMas = Map 


a,b 
where the transformation coefficients satisfy the orthogonality relations 


LU acUge = 5,5. (145.4) 


Conversely 


Map = È UaaUarM@), (145.5) 
The relations (145.3) give the conditions Yœ = (W)? for the eigen- 
values MM‘), so that these must be zero or unity. If only one of the M©) is 


different from zero (say M® = 1), then (145.5) gives 


Mab = UiaU ro, (145.6) 
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i.e. all the matrix elements Mab are expressed in terms of the set of quantities 
Uia, a = 1, 2,.... If several of the M% are non-zero, then the elements 
Map are sums expressed in terms of several sets of quantities Uia, Uza, ... , 
these quantities being related only by the orthogonality relations and other- 
wise independent. This case would correspond to accidental degeneracy, 
where several different quasi-stationary states of the compound nucleus 
correspond to the same quasi-discrete energy level.t Ignoring these un- 
important cases, i.e. considering non-degenerate levels, we therefore conclude 
that the matrix elements Map are products of quantities each depending on 
the number of only one channel. 
With the notation 


|Uia| = V(Po/T), 
we can write formula (145.6) as 
Map = + V(t o) T; (145.7) 


the sign of Mab depends on those of Uia and Ui», and remains indeterminate. 
On account of the equation XU ;,Uj, = 1, the quantities Tg thus defined 
satisfy the relation Í 


Imer. (145.8) 
a 


They are called the partial widths of the various channels. Formulae (145.1), 
(145.7) and (145.8) give the required general form of the scattering amplitudes. 
Let us now rewrite the final formulae, taking some definite channel as the 
input channel.t The partial width of this channel will be denoted by Te 
(the elastic width) and the widths of channels corresponding to various reac- 
tions by Izi, Tye, ... - 
The total elastic scattering amplitude is 





ee, E 
gtp 145.9 
2k E-—Ey+hT EEE); ae? 


fA) = fO"(8)— 
where k is the wave number of the incident particle and f© the potential scat- 
tering amplitude. This formula differs from the expression (134.12) in that 
T in the numerator of the resonance term is replaced by the smaller quantity 
Ie. 
The amplitudes of inelastic processes are, as already mentioned, of purely 
resonance type. The differential cross-sections are 


2 PD ra 
dora = CED ese [Pi{cos 6)]? do, (145. 10) 
4k2 (E—Epo)?+31? 


t This ts particularly clear in the case where all the M‘ = 1. It follows from (145.4) and 
(145.5) that then Map = ap, 1-e. there are no transitions between different chsnnels. In 
other words, this case would correspond to a number of independent quasi-discrete states, 
each occurring in elastic scattering in one channel. 

t These formulae were first obtained by G. Breit and E. Wigner (1936). 
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and the integral cross-sections are 


T R È a 
e=- Olr a 


——— 145.11 
k? (E— Eo)? +4112 ( ) 
The total cross-section for all possible inelastic processes is 
T Pry 
or = (21+ 1) — (145.12) 


k2 (E—E)2-+412 ’ 


where I, = I-T is the total inelastic width of the level. 

It is also of interest to know the value of the reaction cross-section inte- 
grated over the range of energy near the resonance value E = Ep. Since op 
decreases rapidly away from the resonance, the integration with respect to 
E— Eo can be extended from — œ to +00, giving 





2] 2r? Lol; 3 


In the scattering of slow neutrons (for which the wavelength is large 
compared with the dimensions of the nucleus), only s-wave scattering is 
important, and the potential scattering amplitude is a real constant — a. 
Then (134.14) becomes 


Fe 


b= -e ERIEN 


(145.14) 


The total elastic scattering cross-section is 


Te +4akr (E— E, 
E T eSaeaneey (145.15) 
kèo è (E—Ey)?4+412 


The term 4ra? may be called the potential scattering cross-section. We see 
that in the resonance region there is interference between the potential 
scattering and the resonance scattering. The amplitude « can be negligible 
only in the immediate neighbourhood of the level (E— Eo ~ T) (we recall 


that | «k | < 1), and the formula for the slow neutron elastic scattering cross- 
section then becomes 


7 Pe? 
= e EEJ AT (145.16) 
The total cross-section for elastic and inelastic scattering is 
| ee = O (145.17) 
ke (E— Eo)? +4F2 


When potential scattering is negligible, the cross-sections Ce, Cra can be 
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put in the form 
Ge = ol/T, Cra = ol rall. 


The quantity o; is the sum of the cross-sections for all possible resonance 
processes, and may be regarded as the cross-section for the formation of the 
compound nucleus. The cross-sections for the various elastic and inelastic 
processes are obtained by multiplying o; by the relative probabilities of 
particular types of disintegration of the compound nucleus, which are given 
by the ratios of the corresponding partial widths to the total width of the 
level. The possibility of this representation of the cross-sections is the result 
of the factorization of the coefficients Ma» in the numerators of the scattering 
amplitudes. It corresponds to the physical picture of the collision process 
as occurring in two stages: the formation of the compound nucleus in a 
certain quasi-stationary state, and its disintegration through one or another 
channel.t 

As already mentioned in §134, the range of applicability of the formulae 
considered here is limited only by the requirement that the difference 
| E— Eo | should be small compared with the distance D between neighbour- 
ing quasi-discrete levels of the compound nucleus (with equal values of the 
angular momentum). It was also mentioned, however, that the formulae 
as written do not allow the passage to the limit Æ —> 0, which is relevant if 
the value £ = 0 lies in the resonance region. In this case the formulae must 
be modified by replacing the energy Eo by some related constant eo, and the 
elastic width I’, by yey E; the inelastic width I, must again be regarded as 
constant (H. A. Bethe and G. Placzek 1937).[ This change causes the 
inelastic cross-section (145.12) to increase as 1/4/E when E > 0, in accordance 
with the general theory of the inelastic scattering of slow particles (§143). 

When the spins of the colliding particles are taken into account, the form- 
ulae are in general very complicated. We shall consider only the simplest, 
though important, case of the scattering of slow neutrons, when only orbital 
angular momenta / = 0 are involved in the scattering. The spin of the com- 
pound nucleus is obtained by adding the spin 7 of the target nucleus to the 
spin s = 4 of the neutron, i.e. it can take the values j = 7+ 4 (we assume that 
i Æ 0, since otherwise the formulae are unchanged). Each quasi-discrete 
level of the compound nucleus relates to a definite value of j. The reaction 
cross-section is therefore obtained by multiplying the expression (145.12) 
(with Z = 0) by the probability g(j) that the system of nucleus + neutron 
will have the necessary value of j for which there is a resonance level. 

We shall suppose that the spins of the neutrons and of the target nuclei 


+ All the above calculations have been based on a reaction of the form at+X = 6+ Y, in 
which two inıitjal particles (the nucleus and the incident particle) give rise to two particles. 
This assumption is not, however, of fundamental importance, as 1s clear from the physical 
nature of the results obtained. Formulae of the type (145.11) for the integral cross-sections 
are valid also for reactions where more than one particle leaves the nucleus. 

t It is important to note that, for inelastic processes which are possible at small energies 
(for example, radiative capture), the value E = 0 is not a threshold value. A change in the 
partial widths I ra similar to that specified for Te would be necessary for energies close to the 
threshold of the reaction in question, below which it cannot occur at all. 
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are oriented at random. There are altogether (27+ 1) (2s+1) = 2(22+ 1) 
possible orientations of the pair of spins i and s. Of these, 2j+1 correspond 
toa givenvalue j of the total angular momentum. Assuming that all orientations 
are equally probable, we find that the probability of a given value f is 


2j+1 


&(j) = Wairi 


(145.18) 


The formula for the elastic scattering cross-section must be modified 
similarly. Here it must be borne in mind that, in potential scattering, both 
values of j are involved. The factor g(j) (with j corresponding to the resonance 
level) must therefore be included in the second term in (145.15), while the 
term 4ra? must be replaced by the sum 


Feli). trl. 


The fact that resonance reactions go through the stage of formation of a 
compound nucleus in a definite quasi-stationary state leads to some general 
conclusions concerning the angular distribution of the products of these re- 
actions. Each quasi-stationary state has a certain parity (in addition to its 
other characteristics). The system of particles b+ Y formed in the dis- 
integration of the compound nucleus will therefore have the same parity. 
This means that the wave function of this system, and therefore the reaction 
amplitudes, can only be multiplied by +1 when the coordinate system is 
inverted; the squared amplitudes, i.e. the cross-sections, therefore remain 
unchanged. Inversion of the co-ordinates signifies (in the centre-of-mass 
system) the changes @ —> r —0, $ — n +ẹ¢ for the polar angle and the azimuth 
which determine the direction of scattering. The angular distribution of the 
reaction products must therefore be invariant under this change. In par- 
ticular, after averaging with respect to the directions of the spins of all the 
particles participating in the reaction, the cross-section depends only on the 
scattering angle 0, and the distribution with respect to this angle must be 
symmetrical with respect to the change 0 > 2-6, i.e. the angular distri- 
bution (in the centre-of-mass system) is symmetrical about a plane per- 
pendicular to the direction of collision of the particles.t 

Owing to the very large number of closely-packed levels of the compound 
nucleus, the detailed variation with energy of the cross-sections for various 
scattering processes is extremely complex. This complexity makes difficult, 
in particular, the discovery of any systematic changes in the properties of the 
cross-sections from one nucleus to another. It is therefore reasonable to 
consider the behaviour of the cross-sections apart from the details of the 
resonance structure, i.e. averaged over energy ranges which are large 
compared with the distances between levels. With this treatment we also 
make no distinction between the various types of inelastic process, but 


t For particles without spin, the differential reaction cross-section would be simply 
proportional to [P: (cos @)]*, and the symmetry is obvious. 
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divide the scattering only into “elastic” and “inelastic” (in the sense defined 
below).t 

To demonstrate the significance of the averaging processes, we again omit 
the complexities associated with spin, and consider the partial cross-sections 
for scattering with / = 0. 

According to formulae (142.7), 


"gi Tase 
ot a — 112, = ae =| |2}, 
(145.19) 
T 
oim TALEE a 


the elastic and inelastic scattering cross-sections, and therefore the total 
cross-section, are expressed in terms of the same quantity S (the index (0) 
is omitted for brevity). In averaging over the energy interval, the total cross- 
section, which depends linearly on S, is given in terms of the mean value of 
S by 

&, = (m/k?) .2(1— re Š); (145.20) 


the factor 1/k2, which varies only slowly, is unaffected by the averaging. The 
averaged ‘‘elastic’’ cross-section is defined as 


G_OPt = (a/k2)| S—1]2, (145.21) 


which is not in general equal to the mean value õe. In other words, we define 
the elastic scattering by first averaging the amplitude in the outgoing wave 
Setkr/r. With this definition the elastic scattering of a wave packet leaves it 
unchanged in form; we can say that the cross-section (145.21) relates to the 
“coherent” part of the scattering. This means that the part of the elastic 
scattering which occurs through the formation of a compound nucleus is 
excluded: when a long-lived compound nucleus is formed and then dis- 
integrates, the specific features of the incident wave packet are, of course, 
lost. The “inelastic” scattering in the averaged model is now naturally 
defined as the difference Gg°Pt = 6;—6,°Pt, i.e. 


G_Pt = (r/k2)(1—|S]?). (145.22) 


This includes, therefore, not only the various inelastic processes but also that 
part of the elastic scattering which occurs with the formation of an inter- 
mediate compound nucleus. 

It is easy to see that this interpretation gives a correct account of the limit- 
ing cases, and therefore serves as a reasonable interpolation. 

In the region of low energies, where the resonances are well resolved 
(I < D), S is given near each level by the formula 


S = elte) ( 7 —-_). 
E— Epo +1 


i 3 Ì i he optical model of 
+ The following method of averaging (for proceeding to what as called t 
nuclear scattering) was proposed by V. F. Weisskopf, C. E. Porter and H. Feshbach (1954). 
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Averaging gives 
S = e261 — aT ,/D), (145.23) 


where Į, and D are the elastic width and the mean distance between the levels, 
averaged over the levels occurring in the energy range concerned; the slowly 
varying function 6 (E) may be regarded as constant in the averaging. 
Hence we find 





GqoPt = ——, (145.24) 


where small terms ~ I/D have been omitted.{ This expression in fact 
coincides with the mean value of the cross-section (145.17), which, as pre- 
viously mentioned, corresponds to the formation of a compound nucleus. 

As the excitation energy of the compound nucleus increases, the distances 
between its levels decrease, and the disintegration probabilities (and so also 
the total widths of the levels) increase, so that the levels begin to overlap 
(in which case the concept of quasi-discrete levels loses much of its signific- 
ance). The irregularities of the function S(£) are then smoothed out, so that 
the difference between the exact and the averaged functions becomes small, 
and the cross-section (145.22) is the same as o, given by (145.19). This is in 
accordance with the fact that at high energies the disintegration of the com- 
pound nucleus through the input channel is unimportant in comparison with 
the numerous other modes of disintegration possible at such energies. In 
this range, therefore, all processes which involve the formation of a com- 
pound nucleus may be regarded as inelastic. 

Thus in the averaged model the scattering is again determined by a single 
quantity S, which is now a smooth function of energy. In the optical model, 
in order to calculate this function, the scattering properties of the nucleus 
are approximated by a field of force with a complex potential. The imaginary 
part of the potential has the result that absorption of particles occurs as well 
as elastic scattering. This absorption, the cross-section for which is given by 
the expression (145.22), is identified with “inelastic” scattering in the 
averaged model. 


§146. Interaction in the final state in reactions 


The interaction between particles formed as a result of a reaction may have 
a considerable effect on their distribution in energy and angle. This effect 
will naturally be particularly marked when the relative velocity of the inter- 
acting particles is small. Such a phenomenon occurs, for example, in nuclear 
reactions accompanied by the emission of two or more nucleons, the effect 
here being due to the nuclear forces which act between free nucleons. 

Let po be the momentum of the centre of mass of a pair of emergent 


+ Terms arising in the region of a level owing to the presence of other levels would be of the 
same order of magnitude. 

t The results given below were obtained first by A. B. Migdal (1950) and, independentl 
by K. M. Watson (1952). pendently, 
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nucleons, and p the momentum of their relative motion. We shall suppose 
that p < po, and so the relative energy E = p2/m (m being the nucleon 
mass) is small compared with the energy Eo = po?/4m of the motion of the 
centre of mass. We also suppose that the energy Ep is large compared with the 
energy e of the level (real or virtual) belonging to the system of two nucleons. 
That is, only the relative motion of the nucleons is assumed ‘‘slow’’, while the 
nucleons themselves are ‘‘fast’’. 

The probability of reaction is proportional to the squared modulus of the 
wave function of the particles formed when they are in the “reaction zone”, 
1.€. at a distance apart which is of the order of the range a of action of nuclear 
forces (cf. the similar discussion in §143 relating to primary particles). In 
the present case our object is to determine the dependence of the reaction 
probability only on the characteristics of the relative motion of one pair of 
nucleons. It is therefore sufficient to consider only the wave function 
P(r) of this motion, so that the probability of the formation of a pair of 
nucleons with relative momentum in the range d3p is 


dwp = constant x |Pp(a)|*d8p. (146.1) 


It has been shown in §136 that, in order to find the probability that a 
system will enter, through scattering, a state with a definite direction of 
motion, we must take as the wave functions of the final state functions 
Pp which contain (at infinity) only an ingoing wave together with a plane 
wave; these functions must be normalized by the delta function of momen- 
tum. The functions y are also obtained directly (by taking the complex 
conjugate and changing the sign of p) from the functions 96", which contain 
(at infinity) outgoing spherical waves, i.e. those which correspond to the 
mutual scattering of two particles. On substitution in (146.1) this difference 
is not significant, so that p, in (146.1) may be taken to be the functions ¥{*?, 
and the problem is therefore reduced to that of the resonance scattering of 
slow particles, which has already been discussed. 

Although the actual form of the function p, in the regionr ~ ais unknown, 
in order to find the dependence of the probability on the energy E it is 
sufficient to consider this function at distances 7 > 1/k > a (where k = p/h; 
it is assumed that ka < 1), and then continue it in order of magnitude to 
distances r ~ a.t The main contribution to 4, comes from the spherical 
wave (containing the factor 1/r). This wave is an assembly of partial waves 
with various values of J, whose amplitudes are the corresponding scattering 
amplitudes. To determine the square | (a) |2 it is sufficient to consider the 
s-wave alone, since at low energies the scattering amplitudes with / # 0 are 
relatively small. According to formula (133.7) we therefore have 


ikr 
oS (146.2) 





pp ~ 


+ This procedure is permissible because in the region r < 1/k the energy E may be neglected 
in Schrédinger’s equation which determines the function Yp. The dependence of Yp on E in 
this region is therefore entirely determined by the “joining” to the function in the region 
r~ 1/k. l 





K+ik r 
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where x = +/(mle|)/h and e is the energy of the bound (or virtual) state 
of the two-nucleon system.f Substituting this expression in (146.1), we obtain 


dp 
E+|e| i 


dwp = constant x 


(146.3) 


Thus the distribution with respect to direction of the momentum (in the 
centre-of-mass system of the two nucleons) is isotropic. The distribution with 
respect to energy of the relative motion is given by 


JV EAE 
E+|e| 


We see that the interaction of the nucleons leads to the appearance of a 
maximum in the distribution in the range of small Æ, at E ~ | ¢ |.t 

In the laboratory system of coordinates, small angles @ between the 
momenta of the two nucleons correspond to small values of the relative 
momentum (p < fo). Thus in this system an angular correlation between the 
directions of emission of the nucleons corresponds to the maximum in the 
distribution with respect to E, and leads to an increased probability of small 
values of 0. 


Let pı and pe be the momenta of the nucleons in the laboratory system. 
Then 


dwg = constant x 


(146.4) 


Po = pı +P2, p=4(pe—pi) 


(the reduced mass of two equal particles is 4m). The vector product of these 
equations gives po x p = pı x pe, and so if p < po we have 


pof, = pipesin O = $po?0, 


or 0 = 4p,/po, where p, is the component of the vector p transverse with 
respect to the direction of po, and @ is the small angle between the directions 
of pı and pe. Rewriting formula (146.3) in the form 


2rp dp dp i 
(p,2+p,2)/m + |e| 


and integrating with respect to p,, we find the probability distribution as a 
function of the angle 6. Owing to the rapid convergence of the integral, the 
integration can be extended from — œ to + œ, and the final result is 


8d 
dwg = constant x — (146.5) 


v (62 + 4|e|/Eo) 


t We are here considering an np pair with parallel or antiparallel spins, or an m7 pair with 
antiparallel spins. For a pp pair the situation is complicated by the Coulomb repulsion, and 
this case must be treated by means of the theory given in §138. i 

t Strictly speaking, the constant coefficients in formulae (146.3) and (146.4) may also 
depend on E through the remaining parts of the wave function for the whole system of reaction 
products. This dependence is only slight, however: the coefficient varies appreciably, as a 
function of E, only over the whole energy range (~ E,) available to the nucleon Sdra arihe 
reaction considered. Thus this dependence may be neglected, as regards the distribution 


in the range E < Ev, in comparison with the strong dependence given by formula (146.4). 


dwp = constant x 
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The angular distribution relative to the solid angle element do = 276 d0 has 
a maximum at @ ~ 4/(| e |/Eo). 


§147. Behaviour of cross-sections near the reaction threshold 


If the sum of the internal energies of the reaction products exceeds the 
corresponding sum for the original particles, the reaction has a threshold: 
it can occur only when the kinetic energy FE of the colliding particles (in 
the centre-of-mass system) is greater than a certain “threshold” value Er. 
Let us examine the nature of the energy dependence of the reaction cross- 
section near the threshold. We shall assume that the reaction produces only 
two particles (type A+B = A'+B’). 

Near the threshold, the relative velocity v’ of the particles formed is small. 
Such a reaction is the opposite of one in which the velocity of the colliding 
particles is small. The dependence of its cross-section on wv’ is therefore 
easily found by means of the principle of detailed balancing (144.13) and 
the known energy dependence of the reaction where wv’ is the velocity in the 
input channel (§143). Ina wide class of reactions where there is no Coulomb 
interaction between the particles A’ and B’ (such as nuclear reactions in 
which a slow neutron is formed), we therefore find that the reaction cross- 
section 1s proportional to v“? (1/w’), 1.¢.F 


or~ o. (147.1) 


Similarly we find the dependence of the cross-section on the energy of the 
colliding particles: the velocity v’, and therefore the reaction cross-section, 
are proportional to the square root of the difference E— Er: 


or = Ax/(E— B). (147.2) 


The scattering amplitudes in different channels are related by the unitarity 
conditions. The opening of a new channel therefore leads to the appearance 
of certain singularities in the energy dependence of the cross-sections for 
other processes also, including the elastic scattering cross-section (E. P. 
Wigner 1948; A. I. Baz’ 1957; G. Breit 1957). To elucidate the origin and 
nature of this phenomenon, let us consider the simple case where only elastic 
scattering is possible below the reaction threshold. 

Near the threshold, the particles A’ and B’ arc formed in a state with 
orbital angular momentum / = 0 (corresponding to (147.2)). If the reacting 
particles have no spin, the orbital angular momentum is conserved, and the 
system of particles 4+ B is also in the s-state. According to (142.7), the 
partial reaction cross-section for l = 0 is related to the S-matrix element 
for elastic scattering by 


n = 
6, (0) = a —|Sol?), (147.3) 


+ This result corresponds to the constant limit of the amplitude fyi as py +0 derived at the 
end of §144. The cross-section (144.4) is proportional to py. 
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where & is the wave number of the colliding particles. Equating (147.2) and 
(147.3), we find that just above the reaction threshold the modulus | So] is 
given, to within quantities of higher order than 4/(E~E;), by 


h2 
[Sol = 1-—-AV(E-E) (E> E), (147.4) 


where ky = +/(2mE;)/h, and m is the reduced mass of the particles 4 and B. 
Below the threshold we have only elastic scattering, so that 


| So| =q (E < Ei). (147.5) 


The scattering amplitude, and therefore So, must be analytic functions 
for all values of the energy. The function concerned, which takes the values 
(147.4) and (147.5) above and below the threshold, is given to the same 
accuracy by the formula 


h2 
So — an| 1 = —AV(E-E) |, (147.6) 
TT 


where do is constant; for E < E, the root becomes imaginary, and the 
modulus of the expression in the brackets differs from unity only by a quantity 
of a higher order of smallness. 

For all / 4 0 there is no inelastic scattering, so that 


Sy = et: (1 #0), (147.7) 


and in the region near the threshold the phases & must be taken equal to 
their values for E = E.t 

Substituting the values obtained for S; in (142.2), we find the following 
expression for the scattering amplitude near the reaction threshold: 


ki 
f(8,£) Fi fO) = aay eo Ey )e*t?o, ( 147.8) 


where f;(@) is the scattering amplitude for E = E;. The differential scattering 
cross-section is therefore 
do _ 
do 


| f()|2+ ŽA V/(E— E,) im { fi(@)e-2%8,} for E> E,, 


> k 
= | fe 8)? -— AV (E ~ E;) re { f(0je-20,} for E< Er. 


Writing the amplitude f; in the form | frjez, we can finally put this result in 
the form 


do _ (02 — Fe _ El) x 818 (280 — a), E> Es, 
E EO Eels 5 EE Vaz) 


t Since the functions 6;(£) are real both for E> E, and for E < Ee, 


they can be expanded 
series of integral powers of the difference E—E,. Pannen as a 
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Depending on whether the angle 259—« is in the first, second, third or 
fourth quadrant, the energy dependence of the cross-section described by 
this formula has the forms shown in Fig. 50a, b, c, d. In every case there are 
two branches lving on either side of a common vertical tangent. 
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In the integration of the expressions (147.9) with respect to o, the integrals 
of the second terms contain a non-zero contribution only from the isotropic 
part of the amplitude f,(6), the s-wave partial elastic scattering amplitude 
(e218, — 1)/2zk;. We thus obtain for the total elastic scattering cross-section 
near the threshold 
sin® do for E> Es, 
eH 2Aa eH E (147.10) 
ii vl tl) x sin 59 cos ŝo for E> Et. 


This has the form (a) or (b) in Fig. 50, for positive and negative sin ôo COs ĉo 
respectively. pa i. 
Thus the existence of a reaction threshold leads to a characteristic singu- 
larity in the energy dependence of the elastic scattering Cross-section. If 
the particles have spin, of course, the formulae are quantitatively different, 


§147 Behaviour of cross-sections near the reaction threshold 62] 


but the general nature of the effect remains the same.t If other reactions as 
well as elastic scattering are possible below the threshold, then corresponding 
singularities will appear in the cross-sections for such reactions. They all 
have a singularity at E = E, near which they are linear functions of the root 
s/(|\E— E|) with different slopes above and below the threshold. 

In nuclear reactions with emission of a positively charged particle, we 
have a case where Coulomb repulsion forces act between the reaction products 
(the particles A’ and B’). In this case the reaction cross-section, together with 
all its derivatives with respect to energy, tends exponentially to zero as v’ > 0 
(i.e. as E + E;), and there is no singularity in the cross-sections for other 
processes. 

Finally, let us consider reactions in which two oppositely charged slow 
particles are formed, so that Coulomb attraction forces act between them. 
The cross-section for such a reaction is related by the principle of detailed 
balancing to the cross-section (143.6) for the opposite reaction between two 
slow attracting particles. Thus we find that as v’ > 0 the cross-section tends 
to a constant limit: 


Cr = constant as v >0, (147.11) 


i.e. the reaction begins suddenly with a finite cross-section as the threshold 
is passed. 

We may elucidate the nature of the singularity of the elastic scattering 
cross-section near the threshold for such a reaction (A. I. Baz’ 1959). This 
cannot, however, be done directly from the known law (147.11) above the 
threshold by the simple method used previously for uncharged particles. In 
comparison with the latter case the situation is now complicated by the fact 
that the system of particles A’+ B’ has bound states in the region near the 
threshold (with E < E;,), corresponding to discrete energy levels in the 
Coulomb attraction field. These states can be formed, so far as energy is 
concerned, in a collision of particles A and B, but owing to the possibility 
of elastic scattering they are only quasi-stationary states. Their existence 
must nevertheless cause resonance effects in the elastic scattering below the 
threshold, analogous to the Breit-Wigner resonances. 

To solve the foregoing problem, let us consider the structure of the wave 
functions which describe the collision process. In accordance with the 
presence of two channels, Schrédinger’s equation for the system of inter- 
acting particles has two independent solutions finite in all configuration space. 
Let two such solutions, arbitrarily selected and arbitrarily normalized, be 
denoted by yı and %2. From these functions we can construct linear com- 
binations which describe the scattering in the case where one or other of the 
channels is the input channel. Let the channels corresponding to the pairs 
of particles A, B, A’, B’, be denoted by a and b, and let the sum ys = arpi + 
+ aeye correspond to the case of input channel a; it describes elastic Scattering 


+ For non-zero spins, the system of particles A’+ B’ in the s state may have a non-zero 
total angular momentum, and there can therefore be different orbital states of the system 
A+B. 
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of particles A and B and the reaction A+B —> A’+B’. Near the reaction 
threshold, the coefficients « and «2 depend considerably on the small momen- 
tum ky, while the arbitrarily chosen functions y% and We do not have singu- 
larities at kẹ = Q. 

At large distances, the function y must represent the sum of two terms 
corresponding to the motion of pairs of particles in the channels a and 6. 
These terms are the products of the “internal” functions of the particles and 
the wave function of their relative motion.t In channel a the latter function 
has the form R,~ — SaaRa*, and in channel b it is — SgpR,*+, where R+ and 
R- are the outgoing and ingoing waves in the corresponding channels. At 
distances ro which are large compared with the range of action of the short- 
range forces and small in comparison with 1/kp, these functions (and their 
derivatives) must join on to the values calculated from the wave function Ņ in 
the “reaction zone’. These conditions are expressed by equations of the 
form 


x141 + 2242 = [Ra = SaaRat Jr, ab) + x2b2 = [- Savky*],,, 
aya) + agag = [Ra Si SaaRat l r» aby’ + agbo = [— SabRe T ro 


where aj, a1’, bi, b1’, ... are quantities calculated from the functions yı and 
we; according to the above discussion, they may be regarded as constants 
independent of ky near the threshold. Dividing these two pairs of equations, 
we obtain two linear equations for two unknowns (aj/e2 and Saa), the coeffi- 
cients in these equations involving only one quantity which depends “‘critic- 
ally” on kp, namely the logarithmic derivative of the outgoing wave in channel 


b. We define this as 
oe i 
Ri Qa rRyt rafo 


There is no need to derive the actual solution of these equations; it is suffi- 
cient to note that the quantity Saa of interest here (which determines the 
elastic scattering amplitude) is a fractional-linear function of A. Below the 
threshold the quantity À is real, since the wave function Rpt is real, being the 
solution of a real Schrédinger’s equation with a real condition at infinity 
(decrease as e~*", where xy = 4/[2m2(E,—E)]/h). Below the threshold we 
must have |Saa| = 1, whence it follows that the fractional-linear function 
Saa(A) must have the form 


an = —_——247, (147.12) 


where ņ is a real constant and £ a complex constant. 
Let us determine the value of A as a function of the momentum hp. Since 


+ The law (147.11) holds not only for the total cross-section but also for the partial cross- 
sections with various values of Z; cf. the end of §143. The singularity discussed below there- 
fore occurs also for all the partial scattering cross-sections. Its nature is entirely evident from 
the treatment for the case l = 0 given below. The index 0 will be omitted, for simplicity, 
from the corresponding partial amplitudes. 
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Coulomb attraction forces act between the particles A and B, rRp* is the 
Coulomb wave function which is asymptotically Proportional to er at 
infinity. In a Coulomb repulsion field this function is given by the sum 
Go +1Fo, with Go and Fo as in (138.4) and (138.7). The change to an attractive 
field is effected by simultaneously reversing the signs of k and r.t} Making 
this change and calculating the logarithmic derivative (see §138), we havet 


ee zler u(i) -i 0719 


Here kp is assumed real, so that the formula pertains to the region above the 
threshold. For kp —> 0, the first term in (147.13) tends to 7, and the second 


tends to zero (see the fourth footnote to§138). Thus we have above the thresh- 
old 


A=i (E> E). (147.14) 


The passage to the region below the threshold is achieved by replacing k by ix, 
which gives from (147.13) with «> 0l 


A= —cot(x/xp) (E< Ei). (147.15) 


These formulae solve the problem under consideration. The elastic 
scattering cross-section is 


Og = mRg-*| Saa = 1|2. 
Above the threshold we have 
EF 
Saa = NREL ny (E > F); (147.16) 
1 -+ :8* 


like the reaction cross-section, the scattering cross-section is constant in this 
region. The condition |Saa| < 1 signifies that im B > 0. 
Below the threshold 


B —tan(z/xp) 


p*— tan(7/xp) 


Saa = ein 


(147.17) 


This expression has an infinite number of resonances whose density increases 


towards the point E = E;. The resonance energies are the roots of the 
expression 


Saa = — 1, l.e. re ei B — tan (7r/ro)] = 0: 





t In what follows we use Coulomb units. The change in sign of k and y corresponds 
formally to a change in sign of the Coulomb unit of length. 

} To simplify the subsequent formulae, we omit from the expression in the braces the real 
constant —log 2rọ—2C, which is independent of Ry; this amounts to an umimportant re- 
definition of the complex quantity § and the real Quantity 7 in (147.12). 

| The first term in (147.13) gives —} cot (m/«») + ži, and the expression in the braces tends 
to 47 cot(7/xy)+4i7. Here we have used the formula Wx) — Y(—x) = —r cot nx—1jx, 
which can be obtained by logarithmic differentiation of the well-known relation I'(x)I'(—x) = 
—7/x sin mx, and the limiting expression y(x) ~ log x —1/2x as x > œ. 
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nM 


Fic. 51 


they are somewhat displaced relative to the purely Coulomb levels (the roots 
of tan(z/xp) = Q) owing to the short-range forces. As the energy E approaches 
the threshold, the elastic scattering cross-section oscillates between zero and 
47/kg?, as shown diagrammatically in Fig. 51. The width of the entire sub- 
threshold region in which the resonance structure occurs is determined by 
the energy of the first Coulomb level.t 


§148. Inelastic collisions between fast electrons and atoms 


Inelastic collisions between fast electrons and atoms can be considered 
by means of the Born approximation in the same way as elastic collisions in 
§139.1 The condition for the Born approximation to be applicable is, as 
before, that the velocity of the incident electron should be large compared 
with those of the atomic electrons. The energy loss in the collision may have 
any value. If the electron loses a considerable part of its energy, the atom 
is ionized, the energy being transferred to one of its electrons. However, 
we can always regard as the scattered electron that which has the greater 
velocity after the collision; thus, if the velocity of the incident electron is 
large, that of the scattered electron is large also. 

In a collision between an electron and an atom, the coordinate system in 
which their centre of mass is at rest may, as already remarked, be identified 
with that in which the atom is at rest; this latter system will in fact be used 
below. 

An inelastic collision is accompanied by a change in the internal state of 
the atom. The atom may go from the normal state into an excited state of the 
discrete or continuous spectrum; the latter case signifies an ionization of the 


+ Another interesting case of reactions near the threshold is the ionization of an atom by an 
electron whose energy is only slightly greater than the first ionization energy of the atom. 
In these conditions the collision process may be regarded as quesi-classical, but the problem 
is greatly complicated by the presence of three charged particles in the final state. The general 
solution of this difficult problem has been given by G. H. Wannier (Physical Review 90, 
817, 1953). The probability of ionization of a neutral atom is found to be proportional to 
(E—1)®, where a = (v (91/3)—1) = 1:13 and E—J is the amount by which the electron 
energy exceeds the ionization threshold. 

t Most of the results given in §§148—-150 were obtained by H. A. Bethe (1930). 
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atom. In deriving the general formulae, we can consider these two cases 
together. 

We start (as in §126) from the general formula for the transition probability 
between states of the continuous spectrum, and apply it to the system 
consisting of the incident electron and the atom. Let p, p’ be the momenta 
of the incident electron before and after the collision, and Eo, E, the cor- 
responding energies of the atom. For the transition probability, we have 
instead of (126.9) the expression 


ee (eo Be op, Vee 
dwn = £Z] <n, p'|UIO, p> ( P+ En a (148.1) 


where the matrix element is that of the energy of interaction between the 
incident electron and the atom, 


Z 
U = Zejlr— ¥ ejr—ra]; 
aml 


here r is the radius vector of the incident electron, r, those of the atomic 
electrons; the origin is at the nucleus of the atom, and m is the mass of the 
electron. 

The wave functions Y, Yp of the electron are determined by the previous 
formulae (126.10), (126.11); then dw is the cross-section de for the collision. 
The wave functions of the atom in the initial and final states we denote 
by Yo, Yn. If the final state of the atom belongs to the discrete spectrum, 
then Wn (like Wo) is normalized to unity in the usual manner. If, on the other 
hand, the atom enters a state of the continuous spectrum, the wave function 
is normalized by the delta function of the parameters v which determine these 
States (these parameters may be, for instance, the energy of the atom, and the 
momentum components of the electron which leaves the atom in the ioniza- 
tion), The cross-sections thus obtained give the probability of a collision 
in which the atom enters states of the continuous spectrum lying in the range 
of parameters between v and v+ dr. 

Integration of (148.1) over the absolute magnitude p’ gives 


mp’ A 
dig oe Pe hlp | UJOp>|2do’, 





where p’ is determined from the law of conservation of energy: 


(p°—p)/2m = En— Ep. (148.2) 


Substituting in the matrix element the wave functions of the electron from 
(126.10), (126.1 l), we obtain 


mè? pP ; 2 
do, = 42fa ey | | [ue bn ho drd VY do, (148.3) 
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where dr = dV, dV, ... dV z is the element of configuration space of the Z 
electrons in the atom, and we omit the prime to do.t Forn = Qandp = p’, 
(148.3) becomes the formula for the elastic scattering cross-section. 

Since the functions y, and y, are orthogonal, the term in U which contains 
the interaction Ze2/r with the nucleus vanishes on integration over 7, and 
so we have for inelastic collisions 


m 


2 p e2 2 
dan = — — et -rhn *uig dtd V| do. 148.4 
G 477 2h4 p ah Ean bn* Yio o ( ) 





The integration over [V can be effected as in §139. The integral 


alto) = | a-t dV/|r—re| 


is formally the same as the Fourier component of the potential at the point r 
due to charges distributed in space with density p = 0(r—rg). Formula 
(139.1) therefore gives 


dalra) = (477/q2)e- #0. (148.5) 


Substituting this expression in (148.4), we finally obtain the following 
general expression for the inelastic collision cross-section: 


e2m\* 4k’ ai - 
fea (=) ai Ee 410>|2 do; (148.6) 


here the matrix element is taken with respect to the wave functions of the 
atom, and we have introduced, in place of the momenta, the wave vectors 
k’ = p’'/i,k = p/h. This formula gives the probability of a collision in 
which the electron is scattered into an element of solid angle do and the atom 
enters the nth excited state. The vector —/iq is the momentum given to the 
atom by the electron in the collision. 

In effecting the calculations, it 1s more convenient to refer the cross- 
section, not to the element of solid angle, but to the element dg of the absolute 
magnitudes of the vector q. The vector q is defined by q = k’ —k; for its 
absolute magnitude we have 


q2 = k®?+h'2—2kk’ cos. (148.7) 


Hence, for given k, k’, i.e. for a given loss of energy by the electron, 


qdq = kk’ sin $ dd = (kk’/27) do. (148.8) 





+ In this form, it is a general result of perturbation theory, applicable not only to collisions 
of electrons with an atom but also to any inelastic collisions of two particles, and determines 
the scattering cross-section in a system of coordinates where the centre of mass of the particles 
is at rest (m being then the reduced mass of the two particles). 
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Formula (148.6) may therefore be written 


e2\2 dg 7 
dies an( =) £4 fn) E eara] 0y]. (148.9) 
ho} @ a 
The vector q plays an important part in the following calculations. 
Let us examine more closely its relation to the scattering angle $ and to 
the energy E„,— Eo transferred in the collision. We shall see below that the 
most important collisions are those which cause scattering through small 
angles ( < 1), with a transfer of energy which is small in comparison with 
the energy E = 4mv? of the incident electron: E,,—E, < E. The difference 
k—k’ is in this case also small (k—k’ < k), and 


En— Eo = h2(k? —k'2)/2m = h?k(k — k')/m = ho(k—k’). 
Since # is small, we have from (148.7) 
g? = (hk P+ (RY, 
and finally 
q = V[{(En—£o)/fiv}? +(23)°]. (148.10) 
The minimum value of q is 
Qmin = (En — Eo)/ hv. (148.11) 


In the region of small angles we can further distinguish between different 
regions depending en the relation between the small quantities $ and 
vo/v, where vo is of the order of the velocity of the atomic electrons. If we 
consider energy transfers of the order of the energy eo of the atomic electrons 
(En— Eo ~ eo ~ mv2), then for (vo/v)!? <9<1 we have 


q = kò = (mo/h)8, (148.12) 


the first term under the radical in (148.10) can be neglected in comparison 
with the second. In this range of angles, therefore, q is independent of the 
energy transfer. For 3 <1, g may be either large or small in comparison with 
1/ag (where ao is a quantity of the order of atomic dimensions). On the same 
assumption regarding the energy transfer we have 


qao ~~ 1 for $ ~ w/v. (148.13) 


Let us now apply the general formula (148.9) for small q (qao < 1, 


i.e. $ < vo/v). In this case we can expand the exponential factors as series of 
powers of q: 


e—id. r, zZ l—iq.ra = I — igxa; 


we choose a coordinate system with the x-axis along the vector q. On 
substituting this expansion in (148.9), the terms containing 1 give zero, by 
the orthogonality of the wave functions % and Pn and we obtain 


2d 2 
dapa &\, a dAn 2E do 
mn “(5) 7 |<nldzl0> (= [<nldelO>[2£2, (148.14) 
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where dz = e È xa is the component of the dipole moment of the atom. 
We see that the cross-section (for small q) is given by the squared modulus 
of the matrix element of the dipole moment for the transition which corres- 
ponds to the change in state of the atom.t 

It may happen, however, that the matrix element of the dipole moment 
vanishes identically for the transition considered, on account of the selection 
rules (a forbidden transition). Then the expansion of e~*4a must be continued 
to the next term, and we obtain 


ZN 2 
ns 2n( £) aE xq2)|0>|29 dg. (148.15) 
arel a 


Let us now consider the opposite limiting case of large q (qa > 1). If q 
is large, this means that the atom receives a momentum which is large com- 
pared with the original intrinsic momentum of the atomic electrons. It is 
evident from physical considerations that, in this case, we can regard the 
atomic electrons as free, and the collision with the atom as an elastic col- 
lision between the incident electron and the atomic electrons, the latter 
being originally at rest. This can also be seen from the general formula 
(148.9). For large g, the integrand in the matrix element contains rapidly 
oscillating factors e—??*<, and the integral is almost zero if Yn does not contain 
a similar factor. Such a function x» corresponds to an ionized atom, with the 
electron emitted from it with momentum —/q = p- p' given by the law of 
conservation of momentum, as it would be in a collision of two free electrons. 

In a collision with a large transfer of momentum, the incident electron 
and the atomic electron may have final velocities that are comparable in 
magnitude. The exchange effect arising from the identity of the colliding 
particles therefore becomes important, although it was not taken into account 
in the general formula (148.9). The scattering cross-section for fast electrons 
when exchange is allowed for is given by formula (137.9); this formula relates 
to a coordinate system in which one of the electrons is at rest before the 
collision. For a fast electron the cosine in the last term in (137.9) may be put 
equal to unity. Multiplying by the number of electrons in the atom, Z, we 
obtain the cross-section for the collision of an electron with an atom, in the 
form 


2\2 1 1 
do = 42( : = + sea |e do. (148.16) 
cos*> 


mu*/ Lsint9 cos#S sin?$ 








In this formula it is convenient to express the scattering angle in terms of 
the energy which the electrons have after the collision. As is well known, 
when a particle of energy E = mo? collides with one of the same mass at 


+ The cross-section don, summed over all directions of the angular momentum of the atom 
in the final state and averaged over the directions of the angular momentum in the initial state, 
is what is usually of physical interest. After this summation and averaging, the square 
|<n}dz|O>|* is independent of the direction of the x-axis. 
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rest, the energy of the particles after the collision is 
€ = Esin®$, E—e = E cos2$. 


In order to find the cross-section referred to the interval de, we express do in 
terms of de by the relation cos $ do = 27 sin $ cos 9d $ = (7/E) de. Sub- 
stituting in (148.16), we obtain the final formula 


d Ze! ] 1 1 j i? 
k Gare e(E—e) E (laa? 





If one of the energies e and E—e is small compared with the other, only one 
of the three terms in this formula (the first or the second) is important. This 
is as it should be, since, for a great difference between the energies of the 
two electrons, the exchange effect becomes insignificant, and we then return 
to the familiar Rutherford’s formula.t 

The integration of the differential cross-section over all angles (or, what 
is the same thing, over q) gives the total cross-section op for a collision in which 
the atom is excited to the state in question. The dependence of opn on the 
velocity of the incident electron is closely related to the existence or otherwise 
of the matrix element, for the corresponding transition, of the dipole moment 
of the atom. Let us first suppose that this matrix element is not zero. Then, 
for small q, doy is given by formula (148.14), and we see that, as g diminishes, 
the integral over q diverges logarithmically. In tue region of large q, on the 
other hand, the cross-section (for a given energy transfer En — Eg) decreases 
exponentially as q increases, because of the presence (already pointed out) of a 
rapidly oscillating factor in the integrand of the matrix element in (148.9). 
Thus the region of small q plays the principal part in the integral over q, and 
we can restrict ourselves to an integration from the minimum value (148.11) 
to some value of the order of 1/ap. 

As a result we obtain 


On = 8n(e/hv)2|<n|dz|0)|2 log( Bnvh/e2), (148.18) 


where £,, is a dimensionless constant, which cannot be calculated in a general 
form. t 


If, on the other hand, the matrix element of the dipole moment vanishes for 
the transition in question, the integral over q converges rapidly both for 
small q (as we see from (148.15)) and for large q. The most important range 
in the integral is in this case g ~ 1/a,. No general quantitative formula 


t For 


4 a collision of a positron with an atom there 1s no exchange effect, and Rutherford’s 
ormula 


do, = (m Zeti E) dei e? 
holds for all g > l/a. 

t We suppose that En — E, is of the order of the energy €, of the atomic electrons. For larger 
energy transfers (En—E,~ E > e,), the formulae (148.14), (148.18) are still inapplicable, 
since the matrix element of the dipole moment becomes very small, and it is not possible to 
take only the frst term of the expansion in powers of q. 
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can be obtained, and we can deduce only that cy is inversely proportional to 
the square of the velocity: 


an = constant/?. (148.19) 


This follows at once from the general formula (148.9), according to which 
do, is proportional to 1/v? for q ~ 1/ap. 

Let us determine the cross-section do, for inelastic scattering into a 
given element of solid angle regardless of the state entered by the atom. 
To do this, we have to sum the expression (148.9) for all n # 0, i.e. over all 
the states of the atom (of both the discrete and the continuous spectrum) 
except the normal state. We omit from consideration the ranges of large 
and small angles, and suppose that 1 > 9 > (vo/v)?. Then, by (148.12), 
q is independent of the amount of energy transferred.t 

The latter circumstance makes it easy to calculate the total inelastic collision 
cross-section, i.e. the sum 


e\? ERE $ 
> don = an( =) > [nl e~ 103 


n0 n#0 


( 2e? j ee eara] 0 E, (148.20) 


mv? 


il 


do, 


n#O0O 


To do so, we note that, for any quantity f, we have by the multiplication rule 
for matrices 


E | fon|? = E fonfon* = E fon(f*)no = (Sf *)oo- 


The summation here is over all n, including n = 0. Hence 


nzolfon|? = Z LFonl*— [fool = (ff*)oo—| fool?. (148.21) 


Applying this relation for f = X e~***a, we have 


2e2 \2 gua pA do 
dor = (25) IF eD eo (14822 


mu? 


where ¢...) denotes averaging with respect to the normal state of the atom 
(i.e. taking the diagonal matrix element 00). The mean value <È e-**"e) is, 
by definition, the atomic form factor F(q) for the atom in the normal state. 
In the first term in thé braces we can write 


2 


& e~ta.fo 5 Z+ A RERE: 
£ 


Oml 











+ The summation in (148.9) is taken over states with En — Eo > €o also, for which (148.12) 
does not hold. However, the effective cross-section for transitions with a large energy transfer 
is relatively small, and these terms in the sum are unimportant. The condition < 1 ıs 
imposed so that the exchange effects need not be taken into account. 
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Thus we find the general formula 
Ze? \? do 
2 iq(r_ —r 


This formula is much simplified for small g, when we can expand in powers 
of g (vo/v < gay < 1, corresponding to angles (v/v)? < > < vov). Instead 
of effecting the expansion from formula (148.23), it is more convenient to 
sum again over n, using for do, the expression (148.14). Summing with 
the aid of the relation (148.21) with f = dz, and recalling that ¿dzY = 0, we 
have 


do, = (2e/hiv)2<d,2)do/9?2. (148.24) 


It is of interest to compare this expression with the cross-section (139.5) 
for elastic scattering through small angles; whereas the latter is independent 
of $, the cross-section for inelastic scattering into the solid angle element do 
increases as 1/92 when 9 decreases. 

For angles $ such that 1 >$ > v/v (so that ga, > 1), the second and 


third terms in the braces in (148.23) are small, and we have simply 
dor = Z(2e2/mv?)2do/44, (148.25) 


i.e. Rutherford scattering from the Z atomic electrons (without allowance for 
exchange). We recall that, for elastic scattering, we had the result (139.6), 
which is proportional to Z? and not to Z. 

Finally, integrating over angles, we have the total cross-section a, for 
inelastic scattering at all angles and with any excitation of the atom. In an 
exactly similar manner to the calculation of on (148.18), we obtain 


or = Br(ejħv)?<dr2ylog( Behle?) (148.26) 


PROBLEMSt 
PROBLEM 1. Determine the angular distribution for 1 ẹ $3 v? from the inelastic 
scattering of fast electrons by a hydrogen atom (in the normal state). 


SoLutTion. For the hydrogen atom, the third term in the braces in (148.23) vanishes, 


ior the atomic form factor F(q) has been calculated in §139, Problem. Substituting, we 
n 
4 (147292/4)4_ 
poh see a1 
vigi (1+0292/4)4 


ProBLEM 2. Determine the differential cross-section for collisions of electrons with a 
hydrogen atom in the normal state, the latter being excited to the nth level of the discrete 
spectrum (where n is the principal quantum number). 


SOLUTION. ‘The matrix elements are conveniently calculated in parabolic coordinates. 
We take the z-axis in the direction of the vector q; then 


ear = gil? — etigté— n) 


+ We use atomic units tn all the Problems. 
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The wave function of the normal state is 


The matrix elements are non-zero only for transitions to states with m = 0. The wave 
functions of these states are the functions 


Pnn, = (1/4/aen®)e-HE+ 9 F(— n, 1, €/n)F(—ne, 1, /n) 


(n = n+m +1). The required matrix elements are the integrals 





<nyng0let*|000> = ff eio- 2oooyn no in ddn. 
0 6 


The integration is effected by means of the formulae of §f in the Mathematical Appendices. 
The result is 


| <nyn20|et-"|000>|? = 2 alin = no)? + (gn)?]. 


All states with the same m +n, = n—1 have the same energy. Summing over all possible 
values of n, —n, for the given n, and substituting the result in (148.9), we obtain the required 
cross-section: 


((n— 1)? +(gn)?]" dg 


1 
deg 20s a ee e a a 
n "on [3( ) (9 ) TLDs Gn 9 
PROBLEM 3. Determine the total cross-section for the excitation of the first excited state 
of the hydrogen atom. 


SOLUTION. We have to integrate 
J dg 


dog = 28x— ——_——— 
o? 9(g?-+9/4)8 


over all g from. @min = (E,:—E,)/v = 3/80 to gmax = 2v, only the terms of the highest degree 
in v being retained. The integration is elementary, and the result ist 


218, 257 ax i 
o = ——| log (4v) — — | = — 0:555 log —. 
í | pe a 02 °F 6-50 


PROBLEM 4. Determine the cross-section for the ionization of a hydrogen atom (in the 
normal state), with the emission of a secondary electron in a given direction; the energy of the 
secondary electron is small in comparison with that of the primary, and so exchange effects 
are unimportant (H. S. W. Massey and C. B. O. Mohr 1933). 


SOLUTION. The wave function of the atom in the initial state is tp = e T/v n. In the 
final state, the atom is ionized, and the secondary electron emitted from it has a wave vector 
which we denote by x (and energy «€ = $x?). This state is described by a function $f). 





+ The cross-section can also be calculated for arbitrary 7. By numerical calculation, we can 
obtain also the total cross-séction for inelastic scattering by a hydrogen atom: 


Or = Ba log(z2/0- 160) 
a2 


This includes the following contributions from collisions in which states of the discrete 
spectrum are excited, and from those in which the atom is ionized: 


or = $Z log(x?/0-160). 
2 


dio = “Z x 0:285 log(v?/0-012). 
2) 


io 
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(136.9), in which the outgoing part consists (at infinity) only of a plane wave propagated in 
the direction of x. The function 9,’ is normalized by the delta function in «/27-space; 
hence the cross-section calculated from it will relate to d3«/(2m)? or to x2 dx do,/(27)3, where 
dox is an element of solid angle about the direction of the secondary electron. Thus 


4k’ x? 


“= Oe 


|< wfe-* 105]? dodox dx, 


where do is an element of solid angle about the direction of the scattered electron, and 


e- -i]) 


ni2 


Clemo) = fu teeny ay = 


a dV 
T= l-= [eter RG 1; i(xr-+ x.r)) —| . 
À r jel 


We effect the integration in parabolic coordinates, with the z-axis in the direction of x and 
the angle ¢ measured from the (q, x) plane: 


=j a — hiof(f— Sy-+t = +n)—4rn(E— 
| $ al | Jor 419(E—7)cos y-+194/(En)sin y Ki n 


x Flific, 1, ix) dpdédn | 


A=1 


where y is the angle between x and q. The integration over ¢ and y is easily performed by sub- 
stituting yy cos¢ = u, Vy sing = v, which gives 


2x La 
0 


l zÍ [=# sin?y +A°+(« +g cosy)? ! aed 
exp} ——____—-— a ee 


2[i(«-+ 9 cosy)—A] i(x +g cosy)—Ad a-i 


The integral here is found from the formula (f.3) with y = 1, n = 0. The subsequent cal- 
culations, though lengthy, are elementary, and give as a result the following expression for the 
cross-section: 


28k’ x(q? + Zgx cos y+(x2+1) cos®y] 


Keiran- iqa- do do, dw. 


do 


The integration over all angles of emission of the secondary electron is elementary, and gives 
the distribution of scattering over directions, for a given energy $x? of the emitted electron: 
20k’ x (++ x2) Je 42 /xktan— 2x (q1 -xiI 
ae PEGEA 
ka? [++i Pilg pRO e2) 


dx. 





For q > 1, this expression has a sharp maximum at x œ q; near the maximum, 


25 dx do 
Imri [1+ (g9— KE 





Integrating over o, with do = 2mq dg/k? ~ (27x/k®)d(g—«), we obtain the expression 
8 dx/k*«?; this is the same as the first term in formula (148.17), as it should be. 


§149. The effective retardation 


In applications of collision theory, the calculation of the mean energy 
lost by a colliding particle is of great importance. This energy loss is con- 
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veniently characterized by the quantity 


dx = X (En—Eo) don, (149.1) 


which we shall call the (differential) effective retardation; the summation 
is taken, of course, over states of both the discrete and the continuous spec- 
trum. dx relates to scattering into a given element of solid angle.t 

The general formula for the effective retardation of fast electrons is 


ene ç ee dg 
de = Bai —}\ 2 (En — Eo)| <n È e~s a0)? =, (149.2) 
hv] ” a q? 


where don has been taken from (148.9). As in the derivation of (148.23), we 
exclude from consideration the region of very small angles, and suppose that 
1 > > (vo/v)*?. Then q is independent of the amount of energy transferred, 
and the sum over n can therefore be calculated in a general form. 

This is done by means of a summation theorem derived as follows. The 
matrix elements of some quantity f, a function of the coordinates, and of 
its derivative f with respect to time are related by 


(fon = —(i/h)(En— Eo) fon. (149.3) 


Hence we have 
2 (En— Eo)| fon|? = 2 (En— Eo) fonfon* 


= E (En—Eo)fon(f*)no = ih E (flon(f*)no = PS Hoo: 


The wave functions of the stationary states of the atom can be taken real. 
Then the matrix elements of the function f of the coordinates are related 
by fon = fno, and for the matrix elements (149.3) we accordingly have 
(Pon = —(f)no- Thus we can also write the sum in question as 


—ih D(f *)on(f)no = —th(ftf oo. 
Taking half the sum of these two equations, we have the required theorem: 


E (En—Eo)| fon}? = kiA f+ —S P )oo. (149.4) 


We apply it to the quantity 


f = D ette, 


According to (19.2), its derivative with respect to time is represented by the 
operator 


f = — (hi[2m) E [e*4-*6(q. Ya) +(q« Vale“-*]. 
A direct calculation gives 


Fff} = —(ti|m)_@2Z. 


+ If an electron is passing through a gas, the scattering at various atoms 1s independent, and 
Ndx (where N is the number of gas atoms per unit volume) is the energy Jost by the electron 
in unit path by collisions in which it deviates into the given element of solid angle. 
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Substituting in (149.4), we obtain the formula 





2m 
2, jae Pl Cri etal? = Z, (149.5) 


which effects the summation required.t+ 
Thus we find for the differential effective retardation the formula 
Zedg 2Zetdo 
Arr ts 


dea = 


mu q mèg 





(149.6) 


The range of applicability of this formula is given by the inequality 
(v/v)? <9 < 1, i.e. up/v < ang < vjvo. 


Next, let us determine the total effective retardation «(q,) for all collisions 
in which the transfer of momentum does not exceed some value q, such 
that voju < agg, < vjvo: 

qı 


(a) = E | (En—Eo) don; (149.7) 


{min 


min IS given by (148.11). The integration and summation signs cannot be 
transposed, since min depends on n. 

We divide the range of integration into two parts, from Guin tO go and 
from go to qı, where gy is some value of q such that v,/uv < Joo < 1. Then, 
over the whole range of integration from gin to qo we can use for do, 
the expression (148.14): 


Klao) = 8a 7E) | nldal0)*En—Ea) f 4 


{min 


whence 





22 Ag YF qov 
Kao) = Bn) |dn|del0>)%En—Fa) loge (149.8) 


In the range from ge to q, on the other hand, we can first sum over n, which 
gives the expression (149.6) for dx, and then on integrating over g we have 


«(91)—«(go) = 4n( Ze4/mv?) log(gi/go). (149.9) 


To transform the above expressions, we use the summation theorem 
obtained from formula (149.4) by putting there 


Seles ae el UE fa. 


t In deriving this relation we have nowhere used the fact that the state denoted by the 
suffix 0 is the normal state of the atom. The relation therefore holds for any initial state. 
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Commuting f* and f gives (in the present case, f7 is the same asf) f ft -f-f = 
—ihZ|m, so thatt 


2 Non = ~ (2m/e2h?)(E, — Eo)|<n|dz|0>|2 = Z. (149.10) 


The quantities Non are called the oscillator strengths for the corresponding 
transitions. 
We introduce some mean energy J of the atom, defined by the formula 


log I = * Non log(En—Eo)/% Non 


= (1/Z) & Non log(En—4o). (149.11) 


Then, using (149.10), we can rewrite formula (149.8) in the form «(go) = 
(427 Ze4/mv?) log(gohv/f). Adding this to (149.9), we have finally 


(qi) = (4r Ze4/mv?) log(qihv/1). (149.12) 


Only one constant characterizing the atom concerned appears in this 
formula.] 

Expressing q, in terms of the scattering angle 3, by means of q) = mv9d,/h, 
we obtain the effective retardation in scattering through all angles $< 3: 


(91) = (40 Ze4/mv?) log(mv*9y/T). (149.13) 


If qao > 1 (i.e. 9, > vov), we can express x as a function of the greatest 
amount of energy that can be transferred from the incident electron to 
the atom. We have shown in the previous section that, for gay > 1, the atom 
is ionized, almost all the momentum fq and energy being given to one atomic 
electron. Hence åq and e are related by being the momentum and energy 
of an electron, i.e. by e = A2q2/2m. Substituting in (149.12) q1? = 2me,[h2, 
we obtain the effective retardation in collisions where the energy transfer is 
e< &: 


r(e) = (277 Zemo?) log(2meyv*/I?). (149.14) 
In conclusion, we may make the following remark. The energy levels of the 


discrete spectrum of an atom mainly involve excitations of a single 
(outer) electron; i excitation of even two electrons usually requires an 





‘+ The remark made concerning (149.5) applies to this relation also. 

t For hydrogen, J = 0-55me*/h? = 14-9 eV. For heavy atoms we should expect to get good 
accuracy on calculating the constant I by the Thomas—Fermi method. It is easy to establish 
how the values of J thus calculated will depend on Z. In the quasi-classical case, the eigen- 
frequencies of the system of particles correspond to the differences of the energy levels. The 
mean eigenfrequency of the atom 1s of the order of vp/ao; hence we can deduce that J ~ Řvjao. 
The velocities of the atomic electrons in the Thomas—Fermi model depend on Z as Zy 
while the dimensions of the atom vary as Z-1/3, Thus we find that J should be proportional 
to Z: 1 = constant x Z. From experimental results it can be found that the constant is of the 
order of magnitude of 10 eV. 
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energy sufficient to ionize the atom. Hence, in the sum of oscillator strengths, 
the transitions to states of the discrete spectrum form only a part, of the 
order of unity, while those which involve ionization form a part of the order 
of Z. Hence it follows that the main part in retardation (by heavy atoms) 
is played by those collisions which are accompanied by ionization. 


PROBLEM 


Determine the total effective retardation of an electron by a hydrogen atom (J = 0-55 
atomic units); for large energy transfers, the faster of the two colliding electrons is taken to 
be the primary. 


SoLuTION. When the primary and secondary electrons have comparable energies after 
the collision, the exchange effect must be taken into account. Hence, for retardation with 
an energy transfer between some value e, (1 < e, < v*) and the maximum value €nmar 
= $E = łv? (by our definition of the primary electron), we must use the effective cross- 
section (148.17): 


1 J 
— ] de 
(E—e)* «(E—e) 








sE 
T 1 
Henn) eta) = 5 | dat 


a 
Z log(E/8e1)+ 1]. 
E 

Adding this to (149.14), we obtaint 


9 


m [ev ] ais ve 
xk = — JO — z= — — 
z 8| 37V 89 |= z Egoa 


in atomic units. 


§150. Inelastic collisions between heavy particles and atoms 


The condition for the Born approximation to be applicable to collisions 
between heavy particles and atoms, expressed in terms of the velocity of a 
particle, remains the same as for electrons: 


U> vo. 


This follows immediately from the general condition (126.2) for perturbation 
theory to be applicable (Ua,/fiv < 1), if we notice that the mass of the particle 
does not appear there, while Ua,/h is of the order of magnitude of the velocity 
of the atomic electrons. 

In a system of coordinates in which the centre of mass of the atom and 
the particle is at rest, the cross-section is given by the general formula 
(148.3), in which m is now the reduced mass of the particle and the atom. It is 
more convenient, however, to consider the collision in a system of co- 
ordinates in which the scattering atom is at rest before the collision. To do 
spade ee Ea here is no exchange effect, and the 
x = (4rlv?) log(v?/0-55). 
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this, we start from formula (148.1); in a system of coordinates in which the 
atom is at rest before the collision, the argument of the delta function which 
expresses the law of conservation of energy 1s of the form 


$p"?/M—3p*/M+3(p —p)*/Mat+En—Lo, (150.1) 


where M is the mass of the incident particle and M, that of the atom. The 
third term is the kinetic recoil energy of the atom (and could be entirely 
neglected when considering a collision between an atom and an electron). 

For a collision of a fast heavy particle with an atom, the change in the 
momentum of the particle is almost always small in comparison with its 
original momentum. If this condition holds, we can neglect the recoil energy 
of the atom in the argument of the delta function, and we then arrive at 
exactly the same formula (148.3), except that m in the latter must be replaced 
by the mass M of the incident particle (and not by the reduced mass of the 
particle and the atom). Bearing in mind that the transfer of momentum is 
supposed small in comparison with the original momentum, we put p ® p’; 
then the cross-section in a system of coordinates in which the atom 1s at rest 
before the collision 1s 


don = (M2/4n2hi)| Í Í Uetat n*ho drd V}? do. (150.2) 


Taking into account the fact that the charge on the particle may differ from 
that on the electron, we write ze? in place of e?, where ze is the charge on 
the incident particle. The general formula for inelastic scattering, written 


in the form (148.9): 
2\ 2 
do, = 8n(—— | <n|d -inxg1) 204, (150.3) 
hv a q’ 


does not contain the mass of the particle. Hence it follows that all the formulae 
derived from it remain applicable to collisions with heavy particles, provided 
that these formulae are expressed in terms of v and q. 

It is easy to see how the formulae must be modified when they are expressed 
in terms of the scattering angle (the angle of deviation of the heavy particle 
on colliding with the atom). To see this, we notice first of all that the angle 
$ is always small in an inelastic collision with a heavy particle. For, when 
the momentum transfer is large (compared with the momenta of the atomic 
electrons), we can regard the inelastic collision with the atom as an elastic 
collision with free electrons; when a heavy particle collides with a light one 
(the electron), however, the heavy particle hardly deviates at all. In other 
words, the transfer of momentum from the heavy particle to the atom is 
small in comparison with the original momentum of the particle; an excep- 
tion is formed by elastic scattering through large angles, but this is extremely 
improbable. 

Thus, over the whole range of angles, we can put 


q = W{[(En—Eo)/v}? + (A409)?}/h, (150.4) 
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which in practice reduces to 
gh x Mv (150.5) 


everywhere except for very small angles. On the other hand, when consider- 
ing the collisions of electrons with an atom, we had (for small angles) 


q = Vi[(En— Eoo] + (medy. 


Hence we can deduce that the formulae which we obtained for collisions be- 
tween electrons and atoms, if expressed in terms of the velocity and the angle 
of deviation, become formulae for the collision of heavy particles if we 
everywhere make the substitution 


9 -> Mjm (150.6) 


(including the element of solid angle do = 2r sin 9 d3 ~ 273 dè), the 
velocity of the incident particle remaining unchanged. Qualitatively, this 
means that the whole picture of small-angle scattering is (for a given velocity) 
compressed in the ratio m/ M. 

The rules obtained above relate also to elastic scattering through small 
angles. Making the transformation (150.6) in formula (139.4) with 3 < 1, 
we have the cross-section 


doe = 8n(ze2/Mv®)?[ Z— F(Mv SAYE d 9/9. (150.7) 


The elastic scattering of heavy particles through angles 3 ~ 1 reduces to 
Rutherford scattering at the nucleus of the atom. 

Inelastic scattering in which the atom is ionized with a large transfer of 
momentum requires special consideration. Unlike the situation for ionization 
by an electron, there are of course no exchange effects. For heavy particles 
it is characteristic that a large momentum transfer (ga, > 1) does not mean 
a deviation through a large angle; always remains small. The cross-section 
for ionization with the emission of an electron of energy between e and e + de 
is found immediately from formula (148.25), which we write in the form 


doy = 87(ze?/hv)?Z dq, 


putting f2q?/2m = e (the whole of the momentum Aq is given to a single 
atomic electron). This gives 


do, = (27 Z2%e4/mv?) de/e?. (150.8) 


In collisions of heavy particles with atoms, the total cross-section and effec- 
tive retardation are of particular interest. The total inelastic scattering cross- 
section is given by the previous formula (148.26). The total effective retarda- 
tion is obtained by substituting the maximum possible momentum transfer 
Gmax in place of q1 in (149.12). qmax is easily expressed in terms of the velocity of 


the particle as follows, Since even /igmax is small compared with the ori Boal 
momentum Mv of the particle, the change in its energy is related to the 


change in momentum by AE = v.fq. On the other hand, for a large 
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momentum transfer nearly all this energy is given to one atomic electron, so 
that we can write 


e = h2g?/2m = fiv . q < hg. 
Hence we have fig < 2mo, Le. 
Poa = 2m0, Emax = 2mv?, (150.9) 


We may notice that the maximum angle of deviation of the particle in an 
inelastic collision 1s 


Sinax = Agmax/Mu = 2m|M. 


Substituting (150.9) in (149.12), we obtain the total effective retardation 
of a heavy particle: 


k = (4r Z2%e4/mv*) log(2mv2/I). (150.10) 


§151. Scattering of neutrons 


In various physical problems of collision theory it is necessary to consider 
how the scattering process is affected by the motion of the scattering centres. 
Under certain conditions we can solve such problems by means of a type of 
perturbation theory devised by E. Fermi (1936), even if perturbation theory 
is not applicable to scattering by each centre individually. ‘These problems 
include, in particular, that of the scattering of slow neutrons by a system of 
atoms, such as a molecule. We shall refer to this specific problem. 

Neutrons are scarcely scattered by electrons, so that practically all the 
scattering takes place at the nuclei.T We shall suppose that the amplitude for 
scattering by an individual nucleus is small in comparison with the distances 
between the atoms. Then the amplitude of the wave scattered by each 
nucleus in the molecule is small even at the positions of the other nuclei. 
Under these conditions the amplitude for scattering by the molecule is the 
sum of those for scattering by the individual nuclei. 

Perturbation theory is not in general applicable to a collision between a 
neutron and a nucleus: although the range of nuclear forces is small, they are 
very strong within that range. It is important, however, that the amplitude 
for scattering of a slow neutron (whose wavelength is large compared with the 
dimensions of the nucleus) is a constant independent of the velocity. Let fa 
be the amplitude for scattering by the ath nucleus; | fa|? do is the differential 
cross-section for elastic scattering of the neutron by a free nucleus (in their 
centre-of-mass system). 

The constant amplitude can be formally obtained from perturbation theory 


+ It is also assumed that the molecule has no magnetic moment. Otherwise, there is a 
further specific scattering effect due to the interaction of the magnetic moments of the 
molecule and the neutron. 
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if we describe the interaction of the neutron with the nucleus by a “point” 
potential energy 


U(r) = - 2/62), (151.1) 


where M is the reduced mass of the neutron and the nucleus; when this 
expression is substituted in Born’s formula (126.4), the delta function makes 
the integral a constant independent of q. The “field” U(r) thus defined is 
called a pseudo-potential. It should be emphasized that the possibility of 
defining this is due to the fact that f is constant. In the general case of an 
arbitrary neutron energy, the scattering amplitude depends on the initial and 
final momenta p and p’ separately, and not only on their difference q, whereas 
the amplitude given by the Born approximation can depend only on q.t 

If the scattering nucleus executes a given motion (for example, vibrations 
in a molecule), and we average over this motion, then the interaction (151.1) 
is “smeared” over a region of dimensions in general large compared with the 
scattering amplitude f. For such a “smeared” interaction the condition 
(126.1) for the Born approximation to be valid is satisfied. 

Thus we can describe the neutron—molecule interaction by the pseudo- 
potential 


U(r) = —2nh? > = fail Ra), (151.2) 
-~rh 


where the summation is over all the nuclei in the molecule, Rg are their 
radius vectors, and r that of the neutron. Substituting this expression in 
formula (148.3), with M, in place of m, Wm being the reduced mass of the 
molecule and the neutron, we obtain the following formula for the cross- 


section for scattering of a neutron by a molecule, in the centre-of-mass 
system: 


: ; | 
don = Mat pS zy fasn|e**\0>/2do5 (15453) 
exe a i 


the matrix elements are taken with respect to the wave functions of the 
stationary states of motion of the nuclei with energies Ep and En, and the 
momenta p and p‘ are related by the law of conservation of energy: 


(p? =p AZ M = E — Eo. 


Formula (151.3) describes an inelastic collision with a definite change of 


t Although the pseudo-potential gives the correct value of the scattering amplitude when 
perturbation theory is formally applied, this does not mean that perturbation theory is actually 
applicable to this held, For a potential well of depth U, which tends to infinity in such a way 
that U a? = constant (where a is the radius of the well, tending to zero), the conditions 
(126.1), (126.2) are certainly nor satished 
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state of the motion of the nuclei in the molecule (transition 0 —> 7), and is the 
solution of the problem stated: from the amplitudes (assumed known) for 
scattering of neutrons by free nuclei, it determines the cross-section for 
scattering by the molecule, taking account of the intrinsic motion of the nuclei 
and interference effects due to scattering by different nuclei. 

If the nuclei have non-zero spin, the fact must also be taken into account 
that the scattering amplitudes fg depend on the total spin of the scattering 
nucleus and the neutron. This can be done as follows. 

The total spin of the nucleus and the neutron can take two values, 
ja = tat 4, where 7g is the spin of the nucleus; we denote the corresponding 
scattering amplitudes by fg and fz. We form a spin operator whose eigen- 
values for definite values of jg are f? and f3 respectively. This operator is 


fa = q+ ba ~ia, (151.4) 


where i, and § are the spin operators of the nucleus and the neutron, and the 
coefficients ag and bg are given by the formulae 


1 
a = ———] (tat 1ft tifa h, 
a i K +1 + tafe ] 
(151.5) 


= Rae o 


a 


This is easily seen if we note that, for a given value of 7, the eigenvalue of 
the operator §.i is 


s.i = 4O + 1) — t@ + 1) =i]: 


The operators (151.4) must replace fa in formula (151.3), and their matrix 
elements corresponding to the transition considered must be taken. If the 
incident neutrons and the target nuclei are unpolarized, then the scattering 
cross-section must be appropriately averaged. 


PROBLEMS 


PROBLEM 1. Average formula (151.3), assuming the directions of the spins of the neutrons 
and the nuclei to be distributed entirely at random, and all the nuclei in the molecule to be 
different. 


SoLUTION. ‘The averagings with respect to the directions of the spins of the neutrons and 


of the nuclei are independent, and each spin gives zero on averaging; hence 8.1 = 0. If 
the molecule contains no two like atoms, there is no exchange interaction of the nuclear 
spins, and, since their direct interaction is negligible, the directions of the spins of the various 
nuclei in the molecule may be regarded as independent; the products of the form 
(s.i1) (S. i2) therefore also give zero on averaging. For the square (s. i)? we have 


(s.i)? = 48212 = $5(s+1)i(¢ +1) = }a(?' + 1). 
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This gives the following expression for the averaged cross-section: 
do, = ME] Eta nle-H*F510|24 
pL Ma 
alia +1), 5 
a prilet D, EE] do. 
a i a“ 


PROBLEM 2. Apply formula (151.3) to the scattering of slow neutrons by parahydrogen 
and orthohydrogen (J. Schwinger and E. Teller 1937). 


SOLUTION. Before the matrix elements of the spin operators are taken, the expression 
(151.3) for the scattering by a hydrogen molecule is 


T TEP Janje- + eia112]0)] n 
P 


+b . <n|îze -i92 4 fgei/2)0X|2 do, (1) 


a=HYt+f-), b =fr 


+ r/2 being the radius vectors of the two nuclei in the molecule relative to their centre of mass. 

The rotational and vibrational states of the molecule are defined by the quantum numbers 
K, Mx, v (which are together-represented by 7 in (1)). In the electron ground state of the 
Hea molecule, even values of K are possible only for a total nuclear spin J = 0 (parahydrogen), 
and odd values of K only for J = 1 (orthohydrogen) (see §86). We must therefore distin- 
guish two cases: (1) transitions between rotational states with values of K of the same parity, 
which are possible only for unchanged J (ortho-ortho and para-para transitions), (2) tran- 
sitions between states with values of K of different parity, which are possible only when J 
changes (ortho-para and para-ortho transitions). In the first case we have 


(nje 120) = Lajat = [nicos $q . 1/0); 


it should be remembered that the rotational wave function is multiplied by (—1)* when the 
sign of r is changed. The spin operator in (1) then becomes 2a¢+68.1, where Î = îi + ig. 
This operator is diagonal with respect to J, in accordance with the above discussion. The 
square (2a +bs . I)? is averaged, as in Problem 1, giving 


4a? 4 1b27(1+1) 
The result is 
ae A Cacos 4q . JOXI = +f 2+ IL +4 Wf + +f -)?] do. (2) 


In the second case 
Cnleterr210> = —- <nle~***/2[0) = i<n|sin $q . r[0), 


and the spin operator in (1) becomes 5.(î1— î2); it has only matrix elements which are non- 
diagonal with respect to J. The squared moduli of these elements, summed over all possible 
values of the component of the total spin I’ in the final state, are calculated as the mean values 
(diagonal elements) of the square [s.(i, —i,)]* (see the first footnote to §140): 


(s.(i; — iz)]* = 1G —iz)2 
= }(2i,? + 2i —I?) 
= 3[(3 -Z(1+1)]. 
The result is 
don = (1)(3)z E\cnlsin $q- r/0>1°(f + -f -)2 do, (3) 


where the coefficient 1 appears for ortho- 
transitions. 

If the neutrons are so slow that their wavelength ıs large even compared with the size of 
the molecule, then we can put cos($q.r) = 1, sin(4q.r) = 0 in the matrix elements in (2) 


para transitions and the coefficient 3 for para-ortho 
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and (3), so that they are all zero except the diagonal element 00; in these conditions, of 
course, only elastic scattering is possible. The elastic scattering cross-section in this case ts 


4 
dce = Ph +f-)? +11 + 1)(f+—f-)2]do. 


PROBLEM 3. Determine the cross-section for the scattering of neutrons by a bound 
proton, regarded as an isotropic three-dimensional oscillator of frequency w (E. Fermi 1936). 


SoLuTIon. Considering the proton as oscillating about a point fixed in space, we must 
put in formula (151.3), from its derivation, Mm = M and Ma = 4M (M being the mass of the 
proton). Then 

p i 
don = — —Ljfe*9ooo(r)yin,n,n(t)dV |*do, 


p T 


where oo = 4r} f|? is the cross-section for scattering by a free proton, and Ynjngns are the 
eigenfunctions of the three-dimensional oscillator corresponding to the energy levels En = 
kiw(n+3/2); the summation is over all values of 71, nz and m3 whose sum has a given value n. 
The functions ninang are products of the wave functions of three linear oscillators (see §33, 
Problem 4). The required integral therefore falls into the product of three integrals of the 
form 


© 
| e-10:7/2e-0°2?/2g-at1?/2H p (ax)dx 
-@ 


(x = ~/(Mw/h)), which are found by substituting Hn,(x) in the form (a.4) and integrating 
nı times by parts. The result is 


1 Y o0 gz 2Mig2%39,2Na 
oe eS 
m V 2Mgen ny'no!na) 








e-0°/22?do. 


The summation is effected by the binomial theorem, and the final result is 


fos] E q? n 
den = Ja ) e-t /2a?do. 
mn! N EN\ 2a? 


In particular, the elastic scattering cross-section (n = 0, E = FE’) is 








co hw 
days —arotiiatdo “Ge ope tet); 


T 


as E/iw —> 0, oe > 400. 


§152. Inelastic scattering at high energies 

The eikonal approximation used in §131 for the problem of mutual 
scattering of two particles can be generalized to cover also processes (including 
inelastic ones) in the collision of a fast particle with a system of particles, or 
“target” (R. J. Glauber 1958). 

In this generalization, the principal assumptions made are as before. ‘The 
energy E of the incident particle is assumed so large that E > | U|andka > 1, 
where U is the energy of interaction between this particle and the target 
particles, and a the range of this interaction. We consider scattering with a 
relatively smal] momentum transfer: the change /iq in the momentum of the 
incident particle is small in comparison with its original value Ak (q < k). 
This condition, however, now implies not only that the angle of scattering 1S 
small but also that the energy transferred is relatively small. 
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We shall also suppose that the velocity v of the incident particle is large 
compared with the velocities vo of the particles within the target: 


v > vo. (152:1) 


For the scattering of charged particles by atoms, this condition is equivalent 
to the validity of the Born approximation (cf. §§148 and 150): if v > vo, it 
necessarily follows that |U|a/iv < 1. There is therefore no need for the 
present theory in that case. The situation is different, however, for nuclear 
targets, where the particles are held by nuclear and not Coulomb forces. In 
the following we shall discuss the particular case of the scattering of a fast 
particle by a nucleus.t 

The condition (152.1) enables us to consider the motion of the incident 
particle for fixed positions of the nucleons in the nucleus.{ That is, the wave 
function of the particle-target system may be written 


U(r, Rj, Ro, | = f(r; Ri, Ro, ...)®;(R), Ro, ces (132:2) 


Here ®;(Rı, Rg, ...) is the wave function of the ith internal state of the 
nucleus; R;, Ro, ... are the radius vectors of the nucleons in it. The factor 
(r; Ri, Re, ...) is the wave function of the particle undergoing scattering (r 
being its radius vector) for given values of Rj, Re, ..., which act as parameters 
in Schrédinger’s equation 


-a5 Uatt- Ra) |é = 54 (152.3) 


where Ua(r— Ra) is the energy of interaction of the particle with the ath 
nucleon, and Ak the momentum of the particle at infinity.|| 
If we find a solution of (152.3) with the asymptotic form 


¢ = e*t" 4+ F(n', n; Rj, Ro, ...jetkr/y, (152.4) 
where n’ = r/r, n = kjk, then the wave function (152.2) 
Y = D+ FO ett! (152.5) 


will describe scattering by a nucleus that is in its 7th state before the collision: 
the incident wave e" appears in (152.5) asa product with ®,;. The second 


t The condition (152.1) gives relativistic velocities v for 
here the formalism in non-relativistic theory, 
bility to any particular scattering problems. 

t This approximation is analogous to the one on which the theory of molecules is based 
in which the electron state is considered for fixed positions of the nuclei. i 

I} In (152.3) it 1s assumed that the interaction of the particle with the nucleus is equal to a 
sum of binary interactions with the individual nucleons. 


any heavy nuclei. In presenting 
We ignore the question of its practical applica- 
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term in (152.5) represents the scattered wave. This expression, however, 1s 
appropriate for determining the scattering amplitude only if the change in 
the energy of the incident particle is sufficiently small, i.e. if the change in the 
internal energy of the nucleus is sufficiently small; thus, by considering the 
motion of the particle in the constant field of “rigidly fixed” nucleons 
(corresponding to equation (152.3)), we neglect a possible change in the 
energy of this motion. 

To separate the scattering amplitude with a definite change of the internal 
state of the nucleus, we must put 4 in the form 


Y = &*O,+ 2 fr(n, n)® petk" ir, (152.6) 


where the summation is over various states of the nucleus; fy;(n’, n) then 
gives the required scattering amplitude with a particular transition 1 f of 
the nucleus, as a function of the scattering angle (the angle between n and 
n’). Comparison of (152.6) and (152.5) shows that 


fin’, n) = | ©FFO, dr, (152.7) 


where dr = d3R)d3Re ... is the volume element in the configuration space 
of the nucleus. We must again emphasize that this formula is valid only when 
the energies of the states 7 and f have a relatively slight difference. 

The solution (152.4) itself of equation (152.3) is found by the method 
described in §131.t Similarly to (131.7) we have 


F(n’, n; Rj, Ro, ...) = =| [S(p, Ri, Re, ...)— le"? d2p, (152.8) 
T 


where 


S(p, Ri, Re, ...) = exp[2Zz6(p, Ri, Ro, --)], 


8(p, Ry, Ro, ...) = È ôalp — Raj), 
j (152.9) 


o 


l 
ôalp — Rai) = E77 U,(r — Ra)dz. 


— © 


It will be recalled that p is the component of the radius vector r in the 
xy-plane (perpendicular to k), and Rg, is the corresponding component of 
the radius vector Ra; Aq = p’—p is the change in momentum of the 


+ It has been noted in §131 that the initial expression (131.4) for the wave function 1s valid 
only at distances z < ka?. This was not important in the further derivation in §131, but for 
scattering by a system of particles (such as a nucleus) it leads to another limitation: the 
expression (131.4) must be valid throughout the volume occupied by the scattering system, 
i.e. we must have R, < ka?, where Ro is the radius of the nucleus and a the range of the 
potentials U. 


§152 Inelastic scattering at high energies 647 


scattered particle, and only its transverse components appear in (152.8). The 
functions ĉa determine the amplitudes for elastic scattering of the particle 
by the individual free nucleons: 


faz La | {exp[2i8(p)] — 1} e~% d2p, (152.10) 


When îi = f, (152.7) and (152.8) give the amplitude for elastic scattering by 
the nucleus: 


faln’, n) = i | [S(p) — tJe-4 d2p, (152.11) 


where the bar denotes averaging with respect to the internal state of the 
nucleus: 


S(p) = f S(p, Ra, Ra, ...)/®AR1, Re, ...)|? dr. (152.12) 


This formula generalizes the previous formula (131.7). 
Putting in (152.11) n’ = n and using the optical theorem (142.10), we 
obtain the total scattering cross-section: 


o = 2 f (1—re Š) dp. (152.13) 
The integrated elastic scattering cross-section a, is obtained by integrating 


| fu|? over the directions of n’. For small scattering angles 6, we have q = kô 
and the solid-angle element do ~ d2g/k?. Hence: 


oe = | | ful? d?g/kè. 


Writing ffi, with fi from (152.11), as a double integral over d2pd2p’, we 
can carry out the integration with respect to d2g by means of the formula 


f e~a- d2q = (27)28(p— p’), 


and the delta function is then eliminated by integration over d?p’. The 
result is 


oe = | |S- 1]? d2p. (152.14) 
The total reaction cross-section is 


Or = or— 0e = { (1—|Š|2) d2p. (152.15) 
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Note that (152.13)-(152.15) are in agreement with the general formulae 
(142.3)-(142.5): on changing, in the latter, from summation (over large Z) to 
integration over d2p with p = J//k, and replacing S; by the function S(p), we 
obtain (152.13)-(152.15). 


PROBLEMS 
PROBLEM 1. Express the amplitude for elastic scattering of a fast particle by a deuteron 
in terms of those for the proton and the neutron (R. J. Glauber 1955). 


SOLUTION. According to (152.11), the amplitude for elastic scattering by a deuteron 1s 
k l 3 : , 
f'%Xq)= mal |a(R)|?{exp[215n(p — 3R,) + 216 (p+ }R,)]—1} x et" d3R d2p (1) 
771 


Here ¢a(R) is the wave function of the relative motion of the neutron (n) and the proton (p) 
in the deuteron; R = Ra —Rp, and R, is the component of R in the plane perpendicular to 
the wave vector k of the ncident particle. The difference in the braces in (1) may be written 


exp(2iðn + 215 p)— 1 = (e2n— 1) + (ein — 1) + (e2iðn — 1)(e2Hy — 1). 


The integrals can then be trarsformed, using the definition of the scattering amplitudes for 
the neutron f ‘ and the proton f (P, according to (152.10) and the converse formulae 


27i 


exp[2iba(p)]—1 = E | Fonae*a%ol(2ny 
The result is 


f'La) = HAF) +f PaF- q) 


- | reatata a-a) dy’, (2) 


where 


F(q) = Í |Pa(R)|2 e-t9F/2 d3R 


is the deuteron form factor. 
Putting in (2) q = 0 (with F(0) = 1) and using the optical theorem (142.10), we find the 
total deuteron scattering cross-section: 


or D = off”) + oP) +5 re | F(2q) f (q) f P — q) d2o. (3) 


PROBLEM 2. Determine the cross-section for disintegration of a fast deuteron into a 
separate neutron and proton in scattering by a heavy absorbing nucleus whose radius Rọ is 
large compared with the deuteron wavelength (kR, > 1, where Ak is the deuteron momentum) 
and the deuteron radius (E. L. Feinberg 1954; R. J. Glauber 1955; A. I. Akhiezer and 
A. G. Sitenko 1955). 


SoLuTION. With regard to the incident deuteron plane wave, the large (kR > 1) ab- 
sorbing nucleus acts as an opaque screen at which the wave is diffracted. The wave function 
of the incident deuterons is e*-*yYe(R), where ¥a(R) is the internal wave function of the 
deuteron (R = R,—R, is the radius vector between the neutron and the proton in it, r = 
(Rn + R3) that of their centre of mass). The presence of the absorbing nucleus ‘‘removes”’ a 
part of this function corresponding to transverse coordinates of the neutron and the proton 
(pn and pp) that lie in the ‘‘shadow”’ of the nucleus, i.e. within a circle of radius Ra. Thus the 
wave function becomes 


y= eS, pn, pr) ¥a(R), 
where S = 1 for pn, pp > Re and S = Q if either pn or pp is less than Ro.t This function, 


+ The Coulomb interaction of the deuteron with the nucleus is neglected. 
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without the factor pa, corresponds to the expression for the incident wave in the form (131.5) 
(it neglects the curvature of the rays by diffraction), and the factor S therefore has the same 
significance as in §§131 and 152. 

Analogously to (152.13) and (152.14), the total deuteron scattering cross-section. Tt 
(including all inelastic processes) and the elastic scattering cross-section o¢ are 


of = 2 f (1-§) d2p, of = f (S-1» d2p, 


where p = }(pn+ pp) and we have used the fact that S is real; the averaging of S is with 
respect to the ground state of the deuteron, 


Slp) = f Sta? aR. 


For ya, it is sufficient to use the function 


which is valid for distances R beyond the range of the nuclear forces between the neutron and 
the proton (cf. (133.14); x = +/(m|¢«|)/A, where |e] is the deuteron binding energy and m the 
nucleon mass). From the definition of S, 1—S is not zero if one or both of the nucleons come 
within the circle of radius Ry and are absorbed by the nucleus: hence 


Tcap 7 fa-5) d2p = łot (1) 


is the cross-section for the capture of one or both nucleons. On the other hand, o: = Scape + 
Get Odis, where odis is the required cross-section for “diffractive” disintegration of the 
deuteron. Hence 


vas = bor—oe = | S(1~S) dp. (2) 


When Rox > 1, the important distances from the edge ofthe nucleus are those which are 
small and ~ 1/«; then the integration along the edge gives a factor 27R,, and the integration 
in the perpendicular direction can be carried out as if the shadow region were bounded by a 


straight line. Taking the latter as the y-axis, and the x-axis outwards from the shadow, we 
have 


dais = 2nRo | S(x)[1 ~ S(x)] dx, 


with the integral 


f f ¥e(R) aXd¥dZ, R= y(X2+¥24+ Z9, 


—9F 


S(x) = f 


taken over the region Xn, Xp > O for a given x = $(.X, + Xp) or, what is the same thing, 
over the region |X| = |Xn—X,| < 2x. The integral is transformed by changing to the 
variables X and R and to the polar angle in the YZ-plane (with dY dZ ->2nR dR), becoming 


S(x) = ]-—e-424 any | Z dé, (3) 


The integral (2) with this function S(x) is calculated by repeated integration by parts, using 
the formula 


fer- = log 2. 
0 
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The result ist 


T 


Ro (log 2 — 2). 
3x 


Odis = 


+ With the same condition «R, > 1, the capture cross-section is 


Soap = amRo*+7Ro!4n; 
the integral (1) over the region p> Ro is calculated by means of (3), and that over the region 
p< Ro gives 7R,*. This cross-section includes both capture of the whole deuteron and 
capture of one nucleon with release of the other (stripping reaction). The cross-section for 
the latter reaction is calculated as the impact area (averaged with respect to %¢*) corresponding 
to the incidence of only one nucleon on the shadow region, and is 


Scapt, n © Orap, p = 7Ro/4« 


(R. Serber 1947). 


MATHEMATICAL APPENDICES 


§a. Hermite polynomials 


The equation 
y” —2xy'+2ny = 0 (a.1) 


belongs to a class which can be solved by what is called Laplace's method.+ 
This method is applicable to any linear equation of the form 


n dmy 
2 anal % 


m=O 


whose coefficients are of degree in x not higher than the first, and consists 
in the following procedure. We form the polynomials 


P(t) = E amt, Olt) = È btm, 
m=O m=O 


and from them the function 
Z(t) = (Q)ie, 


which is determined to within a constant factor. Then the solution of the 
equation under consideration can be expressed as a complex integral: 


y= Í Z(t)er* dt, 
C 


where the path of integration C is taken so that the integral is finite and non- 
zero, and the function 
V =ettOZ 


returns to its original value when z describes the contour C (which may be 
either closed or open). 
In the case of equation (a.1) we have 


P= t242n, = —2t, Z= —— t V = tina 
2yntl i” 


so that its solution is 
y= f e2t-t3/4 dtjin+l, (a.2) 
t+ See, for instance, E. Goursat, Cours d’ Analyse Mathématique, Vol. II, Gauthier-Villars, 


Paris; V. I. Smirnov, Course of Higher Mathematics, Vol. III, Part 2. Pergamon, Oxford, 1964. 
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For physical applications we need only consider values n > —4. For 
these values the contour of integration can be taken as Cı or Ce (Fig. 52); 


these satisfy the required conditionst, since the function V vanishes at their 
ends (t = +œ or t = — œ). 


Let us find the values of the parameter n for which equation (a.1) has 
solutions finite for all finite x, which tend to infinity, as x > + œ, not more 
rapidly than every finite power of x. First, we consider non- integral values 
of n. The integrals (a.2) along C, and C, then give two independent solutions 
of equation (a.1). We transform the integral along C, by introducing the 
variable u such that t = 2(x—u). Omitting a constant factor, we find 


y= ex. Í e-t du/(u—x)"+1, (a.3) 
C” 


where the integration ‘3 taken over the contour C’ in the complex plane of 
u, as shown in Fig. 53. 


As x > +œ, the whole path of integration C; moves to infinity, and 
the integral in (a.3) tends to zero as e77. As x -> —oo, however, the path 
of integration extends along the whole of the real axis, and the integral in 
(a.3) does not tend exponentially to zero, so that the function y(x) becomes 
infinite essentially as e7'. Similarly, it is easy to see that the integral (a.2) 
along the contour C2 diverges exponentially as x + +00. 


+ These paths will not serve for negative integral n, since the integral (a.2) along them then 
vanishes identically. 
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For positive integral n (including zero), on the other hand, the integrals 
along the straight parts of the path of integration cancel, and the two integrals 
(a.3), along C,’ and C,’, reduce to an integral along a closed path round the 
point u = x. Thus we have the solution 


Ja = ex" d e-u" du/(u—x)7+1, 


which satisfies the conditions stated. According to Cauchy’s well-known 
formula for the derivatives of an analytic function, 


nl e f(t) 
f™x) = = $ an 


mi J (t—x)”+l 


y(x) is, apart from a constant factor, an Hermite polynomial: 
, dn 
Hy(x) = (—1)%e7 i ; (a.4) 


The polynomial H,, expanded in decreasing powers of x, has the open 
form 


n(n—1 n(n—1)(n—2)(n—3 
H,(x) = ea 4— 0... (a5) 


~~ 


It contains only powers of x which are of the same parity as n. We may write 
out here the first few Hermite polynomials: 


Ho =1, Ay = 2x, He = 4x?—2, H; = 8x3—12x, H, = 16x4—48x2+ 12. 
(a.6) 


To calculate the normalization integral, we replace e-7' H, by its expres- 
sion in (a.4) and integrate times by parts: 


(e0) (e0) 
: cl a 
fe Hn (x) dx = | (DPE dx 
dx” 
-0 Gasa) 


[2e] 
ely 
= Í e-z —H, dx. 
dx” 
-0 
But d”H,„/dx"” is a constant, 2”n!. Thus 


| e-® H,2(x) dx = 22nn! r. (a.7) 
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$b. The Airy function 
The equation 


a= 2 (b.1) 


is of Laplace’s type (see §a). Following the general method, we form the 
functions 


P=, Q=-1, Z= —e th, V = ext-8/3, 


so that the solution can be represented in the form 


y(x) = constant x Í e7t-t"/3 dt. (b.2) 
C 


The path of integration C must be chosen so that the function V vanishes 
at both ends of it. These ends must therefore go to infinity in the regions 
of the complex plane of t in which re £3 > 0 (the shaded regions in Fig. 54). 


lqe~ 1 


il 
C 
Fic. 54 


A solution finite for all x is obtained by taking the path C as shown in the 
figure. It can be displaced in any manner provided that the ends of it go to 
infinity in the same two shaded sectors (I and III in Fig. 54). We notice that, 
by taking a path which lay in sectors III and II (say), we should obtain a 
solution which becomes infinite as x > ©. 


Deforming the path C so that it goes along the imaginary axis, we obtain 
the function (b.2) in the form (substituting t = iu) 


D(x) = =. J cos(ux+ $u?) du. (b.3) 
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The constant in (b.2) has been put equal to —7z/24/7, and we have denoted 
the function thus obtained by (x); it is called the Airy function.+ 

The asymptotic expression for (x) for large values of x is obtained by 
calculating the integral (b.2) by the saddle-point method. For x > O, the 
exponent in the integrand has an extremum for t = + +/x, and the “‘direction 
of steepest descent” of the integrand is parallel to the imaginary axis. 
Accordingly, to obtain the asymptotic expression for large positive x, we 
expand the exponent in powers of t+4/x and integrate along the line C, 
(Fig. 54), which is parallel to the imaginary axis; the distance O4 = yx. 
Making the substitution t = —4+/x+2u, we have 


D(x) x Va 


e—(2/3)zx91 -uty T du, 


aa 


whence 
D(x) x px-Lte-2/Dzt, (b.4) 


Thus, for large positive x, the function (x) diminishes exponentially. 
To obtain the asymptotic expression for large negative values of x, we 
notice that, for x < 0, the exponent has an extremum for t = 14/|x| and 
= —14/|x|, and the direction of steepest descent at these points is along 
lines at angles —}m and r respectively to the real axis. Taking as the path 
of integration the broken line C, (the distance OB = y |x|), we have, after 
some simple transformations, 


O(x) = |x{1/4sin(¥[x[943n). (b.5) 


Thus, in the region of large negative x, the function (x) is oscillatory. 
We may mention that the first (and highest) maximum of the function (x) 
is D(— 1-02) = 0-95. 

The Airy function can be expressed in terms of Bessel functions of order 4. 
The equation (b.1), as can easily be seen, has the solution 


Vx Z13(8x/*), 


where Z,/,(x) is any solution of Bessel’s equation of order 4. The solution 
which is the same as (b.3) is 


O(x) = 4y (7x) [7 -1/3( §x9/2) — T1)9( $x3/2)] 
= y (x/3n)K1;3(4x3/2) for x > 0, (b.6) 
D(x) = 4v (alx -13l + Jia(3]>|3/2)] for x < 0, 


where 


I(x) = iv] (ix), K(x) = -——[I _,(x)-1(x)]. 


2 sin vr 





, t We follow the definition proposed by V. A. Fok; see G. D. Yakovleva, Tablitsy funktsiz 
Eitri (Tables of Airy Functions), Nauka, Moscow, 1969. The function P(x) is one of two 
defined by Fok, who denotes it by V(x). In the literature, another definition of the Airy 
function is also found, which differs from (b.3) by a constant factor: Ai(x) = ®(x)/ y r. 
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Using the recurrence relations 


RG) 2K E = ZK (x), 
2K,"(2) E 


we easily find for the derivative of the Airy function 


@'(x) = ———Ko,9($x3/2) for x >0. b.7 
(x) (Gn) 2/3(§x°/*) for x > (b.7) 
When x = 0, 
OG) ees 0070, 
SETCE) 
1/672 
oo) = -2 1) _ _ 9.459. 
24/r 
Figure 55 shows a graph of the Airy function. 
I-O 
os 
0-4 
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‘Or 
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§c. Legendre polynomialst 
The Legendre polynomials P;(cos 6) are defined by the formula 


J di 


e epee 1)". (c.1) 
2!I! (d cos 6)! 


Pi(cos 6) = 





+ There are in the mathematical literature many good accounts of the theory of spherical 
harmonics. Here we shall give, for reference, only a few basic relations, and make no attempt 
at a systematic discussion of the theory of these functions. 


§c Legendre polynomials 657 
They satisfy the differential equation 





B ee 
d Kl+1)P; = 0. 
aa 1 Ce (FE) 


The associated Legendre polynomials are defined by 


Pasay asim 9 AN) 


(d cos 0%” 
= in™ 6 Bie os? 0—17, 
I sin TETT (c.3) 
or, equivalently, 
ee (+m)! . | di—m i 
Pr'(cos 8) = ( DT IZT sin-™ ĝ (d cos oyin (cos? 6 — 1), (c.4) 


with m = 0, 1, ..., Z. The associated polynomials satisfy the equation 





j ee eer ap m? 
Be PE +f —-—— |P" = 0. 9 
sin ô dô (sin a GUia A >) 


The normalization integral 


Í [Pi(p)]? dp 


=1 


(p = cos 0) for the Legendre polynomials i is calculated by substituting (c.1) 
and integrating / times by parts, which gives 


E an! f 
oaks | (oP 1S? de = EIS | O- de 
“1 





Substitution of u = }(1— p) reduces this integral to Euler’s beta function, 
and the result is 


(c.6) 


| (Paw du = 2 


Similarly, it is easily seen that the functions Pi(u) with different / are 
orthogonal: 


{PoP du =0, lel. (c.7) 
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The calculation of the normalization integral for the associated Legendre 
polynomials is easily effected by a similar method. We write [P7"(z)]* as a 
product of the expressions (c.3) and (c.4), and integrate /— m times by parts; 
the result is 


1 


| [PPH]? dp 


“1 


__2_ (lem) 
~ 2141 (l-m)t ce 


It is also easily seen that the functions P;” with different /(and the same m) are 
orthogonal: 


1 


f Pr@yPr) de = 0, lel’, (c.9) 
-1 


The calculation of the integrals of products of three Legendre polynomials ts 
discussed in §107. 

The following addition theorem holds for Legendre polynomials. Let y be 
the angle between two directions defined by the spherical angles 6, ¢ and 
6’, $’: cos y = cos l cos 6’ +sin @ sin 6 cos (¢—¢’). Then 


P;(cos y) = P;(cos @)P;({cos 6’) 





a > 2 - m Pr(cos 6)Pr(cos 6’) cos m($—¢’). (c.10) 


This theorem can also be written in terms of the spherical harmonic functions 


defined by (28.7): 


An 


ET Dy VID’) Y (0). (c.11) 


m= -i 


P(n . n’) 


Here n and n’ are two unit vectors, and Yim(n) denotes the spherical har- 
monic function of the polar angle and azimuth of the direction of n relative to 
a fixed system of coordinates. 

If equation (c. 10) is multiplied by Pi(cos 6) and integrated over do = 
sin 6 dô dd, the integration with respect to ¢ gives zero for all'terms on 
the right that contain factors cos m(¢ — ¢’); using (c.6) and (c.7), we obtain 


4r 
Pilcos 0’). 
TERUNA 





| Pj(cos y)P1-(cos @)do = 811- 


§d The confluent hypergeometric function 659 
This result may be written in the symmetrical form 


An 
| Pa . n2)Pi(n; . M3)do, = bu Pine -N3),  (c.12) 


where nj, ne, ng are three unit vectors and the integration is with respect to 
the direction of nı. 
Finally, we give the first few normalized spherical harmonics Yj: 


Yoo = 1//(47); 

Yio = iy/(3/4r)cos 8, Yy41 = F14/(3/8r)sinð . etió; 
= 4/(5/167)(1 —3 cos26), 

Your = +4/(15/87) cos @ sin®@ . eté, 

Yo.49 = —+/(15/327) sin?0 . et2#¢; 


a 
i=) 
| 


Y30 = —14/(7/16r) cos 8(5 cos*6— 3), 
Y3,41 = +14/(21/647) sin A(5 cos?8— 1)e+#9, 
Y3, 42 = —24/(105/3277) cos @ sin?0 . e+2¢, 
Y3,43 = +24/(35/647) sin’ . et, 


$d. The confluent hypergeometric function 
The confluent hypergeometric function 1s defined by the series 


aa +1 g A 
F(a,y,2) = 1+- — ae E 


yl yo+) M 





ery 


which converges for all finite z; the parameter « is arbitrary, while the 
parameter y is supposed not zero or a negative integer. If « is a negative 
integer (or zero), F(a, y, z) reduces to a polynomial of degree |a]. 

The function F(a, y, z) satisfies the differential equation 


zu’ +(y— zju —au = 0, (d.2) 


as is easily seen by direct verification.t By the substitution u = 2’, 
this equation is transformed into another of the same form, 


zu” +(2—y—2z)m'—(a—y +1) = 0. (d.3) 
Hence we see that, for non-integral y, equation (d.2) has also the particular 
integral z1 F(a—y+1, 2—y, z), which is linearly independent of (d.1), 
so that the general solution of equation (d.2) is of the form 


u = F(a, Ys 2)-+ coz! “YF (a—yt 1, 2—y, 2). (d.4) 


t The equation (d.2) with a negative integral y does not require special discussion, since it 
can be reduced to a case of positive integral y by the transformation which gives equation (d.3). 
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The second term, unlike the first, has a singular point at z = 0. 
Equation (d.2) is of Laplace’s type, and its solutions can be represented 
as contour integrals. Following the general method, we form the functions 


P(t) = yt—a, Q(t) =t(t—1), Z(t) = te-1(t—1)7-2-1, 
so that 


“= Í etzta-(t—1)7-a-1 di. (d.5) 


The path of integration must be chosen so that the function V(t) 
= e't*(t—1)”~ returns to its original value on traversing the path. Applying 
the same method to equation (d.3), we can obtain for u a contour integral of 
another form: 


u = zy Í etzta-y(t—1)— dż. 
The substitution tz -> t reduces this integral to the convenient form 


u(z) = Í et(1—z)-a10-7 dt, (d.6) 
and the corresponding function V to 


V(t) = ete-ytl (1z) 7". 


The integrand in (d.6) has in general two singular points, at t = z and 
t= 0. We take a contour of integration C which passes from infinity 
(ret > —oo)round the two singular points in the positive direction and back 
to infinity (Fig. 56). This contour satisfies the required conditions, since 
V(t) vanishes at its ends. The integral (d.6), taken along the contour C, 
has no singular point for z = 0; hence it must be the same, apart from a 





Fic. 56 


constant factor, as the function F(a, y, z), which also has no singularity. 
For z = 0 the two singular points of the integrand coincide; according to a 
well-known formula in the theory of the gamma function, 


= Í PET (4.7) 


mi 


§d The confluent hypergeometric function 661 
Since F(a, y, 0) = 1, it is evident that 


r 
F(a, Yo z) = v Í et(t—z)-4ta-Y dt. (d.8) 
C 


The integrand in (d.5) has singular points at t = Oand¢ = 1. If re(y —a) 
> 0, and y is not a positive integer, the path of integration can be taken as 4 
contour C” starting from the point £ = 1, passing round the point ¢ = 0 
in the positive direction, and returning to t = 1 (Fig. 57); for re(y—a) > 0, 


Ci ae 


Fic. 57 


the function V(t) returns to its original value of zero on passing round such a 
contour.t The integral thus defined again has no singularity for z = 0, 
and is related to F(a, y, z) by 


1 Pda) 


F(a,y,2) = = Tyna) 


$ etz( —t)=-1(1— t)r -2-1 dt. (d.9) 


The following remark should be made concerning the integrals (d.8), (d.9). 
For non-integral « and y, the integrands are not one-valued functions. 
Their values at each point are supposed chosen in accordance with the con- 
dition that the complex quantity which is raised to a power is taken with the 
argument whose absolute value is least. 

We may notice the useful relation 


F(a, Yo z) F enga, =z), (d.10) 


which is obtained at once by substituting t > t+2z in the integral (d.8). 

We have already remarked that, if « = —n, where n is a positive integer, 
the function F(a, y, z) reduces to a polynomial. A concise formula can be 
obtained for these polynomials. Making in the integral (d.9) the substitution 
t> |—#/z and applying Cauchy’s formula to the resulting integral, we find 


FA ) : : a +n-l d.11 
—n, Y: Z = Á zypz = -Zy y+- À f 
y(y+ 1)... (y+n—1) TG = ) (d.11) 


If also y = a positive integer m, we have the formula 


(—1)™-1 dmtn-1 


F(—n, m, z) = e 
m(m+1)...(m+n—1) dgm+n-ı 


(e-z2n). (d.12) 


t If y is a positive integer, C’ can be any contour which passes round both the points t = 0 
and t = 1. 
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This formula is obtained by applying Cauchy’s formula to the integral derived 
from (d.8) by the substitution ¢ > z—t. 


The polynomials F(—n, m, z), 0 <m <n, are (apart from a constant 
factor) the generalized Laguerre polynomials 


(nl)? 
L,™(z) = (—1)"——___F((— [n— m]}, m+ 1, 2) 


m\(n—m)} 


n! dn 
= — e? — (ezzr m) 
(n—m)! dz” 


n! dnem 
ezm ——(e-zz”). (d.13) 


(1—- 2m)! dan-m 





= (—1)" 


The polynomials L,,” for m = 0 are denoted by L,(z) and are called simply 
Laguerre polynomials; from (d.13) we have 


d” 
Ly(z) = e —(e-72). 
dz” 


The integral representation (d.8) is convenient for obtaining the asymptotic 
expansion of the confluent hypergeometric function for large z. We deform 
the contour into two contours C; and C2 (Fig. 56), which pass round the points 

= 0 and ż = z respectively; the lower branch of C, and the upper branch of 
C, are supposed to join at infinity. To obtain an expansion in inverse powers 
of z, we take (--z)~“ outside the parenthesis in the integrand. In the integral 
along the contour C, we make the substitution t + t+z; the contour Ce 
is thereby transformed into C,. We thus represent the formula (d.8) as 








ly) 
F Zz) = —27)-FOG(a.a-— ome 
(x,y, 2) Poa) )-#G(a,a—y+1, —2z)-+ 
gee E a: 1—«a, 2), (d.14) 
D(a) 
where 
G(«,f, z) = — Í (1+-) -et dt. (d.15) 
mi z 
C, 


In raising —z and z to powers in the formula (d.14) we must take the argu- 
ments which have the smallest absolute value. Finally, expanding (1-+#/ zy“ 
in the integrand in powers of t/z and applying formula (d.7), we have for 
G(«, B, z) the asymptotic series 


aß  a(a+1)8(B+1) 
Gepasit +... an (d.16) 


Formulae (d.14) and (d.16) give the asymptotic expansion of the function 
F(a, y, 2). 
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For positive integral y, the second term in the general solution (d.4) 
of equation (d.2) is either the same as the first term (if y = 1) or meaningless 
(if y > 1). In this case we can take, as a set of two linearly independent 
solutions, the two terms in formula (d.14), i.e. the integrals (d.8) taken along 
the contours C, and C, (these contours, like C, satisfy the required conditions, 
so that the integrals along them are solutions of equation (d.2)). The asymp- 
totic form of these solutions is given by the formulae already obtained; it 
remains for us to find their expansion in ascending powers of z. To do this, 
we start from equation (d.14) and the analogous equation for the function 
z! F(a—y+l1, 2—y, z). From these two equations we express 
G(«,«—y+1, —z) in terms of F(a, y, z) and F(a—y+1, 2—y, z); we then 
put y = p+e (p being a positive integer), and pass to the limit e —> 0, 
resolving the indeterminacy by L’Hospital’s rule. A fairly lengthy calcula- 
tion gives the following expansion: 


sin ra. 0(p—«) 
G(a, a—p-+- 1, —z) = eye 


Dp) (a+s)[f(a+s s stl 
xfloge. F(a, p,2)+ > Reet nike set —Wrr OL 


= PEHA) 
LOEO) 
+t Tere) “} ae 


s=1 


where ù denotes the logarithmic derivative of the gamma function: (a) 


= P(a)/P(«). 


§e. The hypergeometric function 
The hypergeometric function is defined in the circle |z| < 1 by the series 


= 2 
F(a, B,y,2) = ieee 2 genet 1) 


VET y+) j +..., (e.1) 


and for |z| > 1 it is obtained by analytical continuation of this series (see 
(e.6)). The hypergeometric function is a particular integral of the differential 
equation 


2(1—2z)u’ + [y—(a+8+)z]u'—ofu = 0. (e.2) 


The parameters « and § are arbitrary, while y # 0, —1, —2,.... The 
function F(«, B, y, z) is evidently symmetrical with respect to the parameters 
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« and B.t The second independent solution of equation (e.2) is 


21-1F(P—y-+1,0—y+41,2—y, 2); 


it has a singular point at z = 0. 

We shall give here for reference a number of relations obeyed by the hyper- 
geometric function. 

The function F(a, B, y, z) can be represented for all z, if re(y—a) > 0, 
as an integral: 


1 T—a)PG) 


F(a,8,y,2) = — (—t)2-1(1—2)7-2-1(1 —#2)-# dt, 
4 


a ROH 
(e.3) 
taken along the contour C” shown in Fig. 57. That this integral in fact 
satisfies equation (e.2) is easily seen by direct substitution; the constant 
factor is chosen so as to give unity for z = 0. 

The substitution u = (1—2)’-*~4u, in equation (e.2) leads to an equation 
of the same form, with parameters y—a, y—f, y in place of «, B, y respec- 
tively. Hence we have 


F(a, B,y, z) gg (l—z)7 Ryn; V1); (e.4) 


both sides of this equation satisfy the same equation, and they have the same 
value for z = 0. 

The substitution 1 -> #/(1—z+2t) in the integral (e.3) leads to the fol- 
lowing relation between hypergeometric functions with variables z and 
2](z—1): 


F(a,B,y,2) = (1—-z)-*F (a, y—B, y, 2/(z—1)). (e.5) 


The value of the many-valued expression (1—z)~* in this formula (and of 
similar expressions in all the following formulae) is determined by the con- 
dition that the complex quantity which is raised to a power is taken with the 
argument whose absolute value is least. 

Next we shall give, without proof, an important formula relating hyper- 
geometric functions with variables z and 1/z: 


_ TON (B-«), 
F(a,B,y,z) = TAG)” F(a,a+1—y,a+1—8, 1/2)+ 
Py) P(«—B) 
aF, B+ 1—y, B-+1—«, 1/2). (e.6) 
Tayo). 2)-*F(B,B-+1—y,B z) 


+ The confluent hypergeometric function is obtained from F(a, B, y, z) by taking the limit 
F(a, y, 2) =e F(«,B, y, 2/8). 


The notation F(a, P, y, z) for the hypergeometric function and iF (a, y, z) for the con Rua 
hypergeometric function is also used in the literature. The subscripts to the left and right o 
show the numbers of parameters in the numerators and denominators respectively of the 
terms in the series. 
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This formula expresses F(a, f, y, z) as a series which converges for z| > 1, 
i.e. it is the analytical continuation of the original series (e.1). 
The formula 


My) y—«—P) 
F(a, B, Yı z) =i mona or «+8+ ys 1—z)+ 
P(y)P(«a+B—y) 


— 27)\¥-a- = = 1—a«— Ea 
Pere) (1—z)7-*-#F(y—a,y—B, y+ B, 1—z) 


(e.7) 


relates hypergeometric functions of z and | — z; again, the proof will not 
be given here. Combining (e.7) with (e.6), we obtain the relations 


P(y)P(B—«) = E k A 
F(a, B, y, 2) = Te@rooa F(a, y—B,a+1—, 1/(1—z))+ 
(y)P(«a—B) 
—— (1—2) PFB, y—«, B+1—2«, 1/(1—2)), 8 
hero z. z)-PF(B, y—a, B /(i-—-2)) (e.8) 
ry) y—a—8) a aol 
F(a, B,y,2) = Toa o F(s, afi—y,a+tp+1 re )+ 


P(y)l'(a+8—y) "R -a-b yf-YF{ 1 — os == ae 9 
Tore) -(1 zy gP-Y ( Byy Byyt+ B, i ). (e.9) 


Each of the terms in the sums on the right of equations (e.6)-(e.9) is itself 
a solution of the hypergeometric equation. 

If « (or f) is a negative integer or zero, « = —n, the hypergeometric 
function reduces to a polynomial of the mth degree, and can be represented 
in the form 


Filan Biggs) pics a a rer ey (e.10) 
amo eae i i 


These polynomials are the same, apart from a constant factor, as the Jacobi 
polynomials, defined by 





Ppd (2) = r ED y, a+b+n+l,a+1, }—42) 
— 1y” n 
= A (1 —2)-4(1 + 2)-2 Z ((1 — z)e+n(1 + z)b +n]. (e.11) 


For a = b = 0, the Jacobi polynomials are the Legendre polynomials. For 
n = 0, Po% = 1. 
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$f. The calculation of integrals containing confluent hypergeometric 
functions 

Let us consider an integral of the form 
as | 
ia? = | mere, kz) dz. (f.1) 
0 
We assume that it converges. If this is so we must have re v > —1 and 
re A > |rek|; if « is a negative integer, the latter condition can be replaced 


by re A > 0. Using for F(a, y, kz) the integral representation (d.9) and 
effecting the integration over z under the contour integral, we have 


1 TF0) 
2mi PF(y—a) 
1 F(1—&)F(y) 


= —— — Arv 1)x 
2m F(y—a) pa 


[r a) 
Haye = — $| eÀ -ktzzr(— t)e-1(1—t)r-2-1 didz 
c’ 0 


x $ (—t)2-1(1—t)r-2-1(1—kt/à) -1 dt. 
c 
Using (e.3), we have finally 
Jay? = Vv IAAF (0,01, y, kA). £2) 


In the cases where the function F(a, v+1, y, k/A) reduces to a polynomial, 
we have for the integral J,” an expression in terms of elementary functions: 


d” 
Jap na = (= PTU eB) ah (f.3) 


ya ot DA— —k)ytn-v-1 dn 
ny =(— eran eR SEIT al A—k)*-741], f. 


Jam” = DNES ee ee 1)! oO yetQ— Beep 
i km-1(1—a)(2— a)... (m—1—a) dan 
m-n-2 


d 
+ni(m—n— 1)... (m—1)Aa-n-HA—k)-1Hm-n-a pa-e-1(a—A)e =]; (f.5) 





here m, n are integers, with 0 < n < m—2. 
Next, let us calculate the integral 


hid f e-keg>-I[F(—n, y, kz) F dz, (£.6) 
0 
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where 7 is an integer and re v > 0. To calculate this, we begin with a a 
general integral having e7% instead of e-*# in the integrand. We write aide 
of the functions F(—7, y, kz) as a contour integral (d.9), and then integrate 
over z, using formula (f.3): 


f 1 P+na)r(y) 
J ezzr -A F(—n, y, kz) ]® dz E e. x 


x f$ (—t) -1 — t)r -te A -khiz2¥-1 F(— 7, y, kz) dtdz 
0c 
2 
L l e pE CEAO 
2m P2(y-+7) 
an 
x $ (A—At—k)v+"-»(—1t)-9-1(1 ea —¥(A—ki—k)-7] dt. 
Cc’ 

The nth derivative with respect to A can evidently be replaced by a derivative 
of the same order with respect to t; we then put A = k,’and thereby return 
to the integral J,: 


1 P+O) 


d7” 
fim <a oe $ (—2) rena a a 
CY’ 


By integrating n times by parts, we transfer the operator d"/dé* to the expres- 
sion (—f)’-*-! (1—1£)"t™-1, and then expand the derivative by Leibniz’ 
formula. As a result, we obtain a sum of integrals, each of which reduces 
to Euler’s well-known integral. We finally have the following expression 


for the integral required: 
P(v)n! 
Jo = —— x 
R’y(y+1)...(y+n—1) 


n-1 


a o oes 
x 21 > ee £7 
tt (s+ 1)! yy +1)... +s) on 


It is easy to see that the integrals J, are related by 


s=0 


gia a 


Ire ana Jyp, (£8) 


where is any integer. 
We similarly calculate the integral 


oD 


J= f ez7-1 F(a, y,kz)F(o’, y, kz) dz. (f.9) 
0 
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We represent the function F(a’, y, k’z) as a contour integral (d.9), and 
integrate over 2, using formula (f.3) with n = 0: 


1 F(1—«)F(y) : 
J = pe Eeay $| (—t)* -1(1— t)r- -l gy-le-zlà -k OF (a, Y, kz) dzdt 
qT — KX 
á Cc’ 0 
1 To’? 
ie Sa $ (=) A117 1K) K EB dt 
1 2 
T yoga M 


By the substitution £ -> At/(A’t+A—A’), this integral is brought to the form 
(e.3), giving 


kk’ 
= T(t Akk- F a, «’, y, ——__—_). £10 
J = TEAR (se ne). E 
If « (or «’) is a negative integer, « = —n, this expression can be rewritten, 
using (e.7), as 
Pe(y)P —a’ 
= TOE Anta’ -y(A— Rk) "(A — RNT X 
Pyta) ye) 
F( 1 —nte'+1 aati (f.11) 
Xf —n,a,—n <i ee | 
a ee 
Fin-liy, let us consider integrals of the form 
c 
J?e, 0) = Í e-k+kzl?zy-1+5F(a, y, kz)F(a’,y—p,k’2z) dz. (£.12) 
0 


The values of the parameters are supposed such that the integral converges 
absolutely; s and p are positive integers. The simplest of these integrals, 


TEE a’), is, by (f.10), 


spel , Sommers Sack: A Akk N. 
J a) = WT (y)(R-E Rt -rk — k) (kk) -F (aa Ys -——5) ; (f.13) 


if « (or a’) is a negative integer, « = —n, we can also write, by (f-11), 


7 pen Gee + De Gaates Dy 
j y(y+1)...(y-+n—1) 


kk’ N2 
(= (R+) mek WAFL —n, a’, oft 1n—y,( -) r; (f.14) 


Jn, a) 





k—k 


§f Calculation of integrals containing confluent hypergeometric functions 669 


The general formula for J,*?(«, a’) can be derived, but it is so complex that 
it cannot be used conveniently. It is more convenient to use recurrence 
formulae, which enable us to reduce the integrals J,%"(«, «’) to the integral 
with s = p = 0. The formula 


—1 
Jea) = Ua, a) J,a) (£.15) 


enables us to reduce J,*?(«, «’) to the integral with p = 0. The formula 


4 
Ja, a’) = pa pa ARB) hath’! — B's) Jy (0,0 )+ 


+5(y—1+5—2a’) J,3-10(a, a’) + 20's] y-a, 1)} (£.16) 


then makes possible the final reduction to the integral with s = p = 0.+ 


t See W. Gordon, Annalen der Physik [5] 2, 1031, 1929. 
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total 202 
Orthogonal functions 10 
Orthohelium 258 n. 
Orthohydrogen 334, 643-4 
Orthonormal functions 10 
Oscillation theorem 60-1 
Oscillator 

in external field 149 

linear 67-74 

anharmonic 136 
strength 636 
three-dimensional 111 


Parabolic 

coordinates 129 

quantum numbers 131 
Parahelium 258 n. 
Parahydrogen 334, 643-4 
Parity 97 

addition rule for 101 

selection rules for 98 
Particle density operator 247 
Paschen — Back effect 467, 468 
Pauli matrices 204 
Pauli's principle 230 
Periodic system 271-9 
Permutations 234- 41 
Perturbation theory (VI) 133 ff. 
Perturbations 

adiabatic 148, 195-8 

in diatomic molecule 352-3 
Phase 

factor 7 

shift 109 

space 172, 261, 481 
Physical quantity 5 
Physical sheet 527 
Planck’s constant 20 
Plane wave, resolution of 112-14 
Point groups 362- 70, 378 — 82, 389 - 97 

continuous 370, 389 - 93 
Poisson bracket 27 n., 36 n. 
Polar vector 98 
Polarizability 285 
Polarization of particles, partial 221 - 3 
Polyatomic molecules (XIII) 398 ff. 
Positive terms 331, 425 
Potential barrier 79- 81 

in quasi-classical case 179 - 85, 191-5 
Potential scattering 562, 611 
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Representation (cont.) 
momentum 44, 535-8 


Potential wall 76-8, 170 
Potential well 63-7, 72-4, 110-11, 127-8, 


137-8, 160, 162-3, 549 
Predictable measurements 5 
Pre-dissociation 344-55 
Principal groups 274-5, 311 - 13 
Principal quantum number 119, 252, 485 
Principle of least action 20 
Probability current density 57 
Product 

of elements of group 360 

of groups 362 

of matrices 32 

of operators 14 

of quantities 13 

of representations of groups 376 

of spinors 209 
Pseudo-potential 641 
Pseudoscalar 98 
W-operators 246 
Pure states 39 n. 


Quadrupole moment 281 - 4, 496 
Quantum mechanics 2 
basic concepts of (I) 1 ff. 


and classical mechanics 2-3, 20-1, 51 - 2, 


54, 97, 160, (V11) 164 ff., 199 f., 227, 


456, 509, 521-6 
Quantum number 
azimuthal 104 
magnetic 104, 130 
parabolic 131 
principal 119, 252, 485 
radial 104, 119 
vibrational 319 
Quasi-classicality condition 165 
Quasi-classical systems 20, 48, 114, (VIÐ) 
164 ff., 521-6 
Quasi-discrete spectrum 559 
Quasi-stationary states 159, 559, 593, 608 


Racah coellicients 445 
Radial 

quantum number 104, 119 

wave function 103-5 

in Coulomb field 117 — 26 

in free motion 106-9 
Rainbow scattering 526 n. 
Ramsauer effect 547 n. 
Range, effective 555 
Reciprocity theorem for scattering 513, 

586. 606 

Reflection above the barrier 191-3 
Reflection amplitudes 78 
Reflection coefficient 77 

in quasi-classical case 191 — 3 
Regge poles 590 
Regge trajectories 590 
Representa1ion 

coordinate 44 

energy 33 n. 


of group 370-85 
antisymmetric product of 377 
basis of 371 
character of 372 
dimension of 199 n., 371 
direct product of 376 
equivalent 372 
irreducible 199 n., 372-85 
Kronecker product of 376 
physically irreducible 383 
reducible 372 
regular 375 
symmetric product of 377 
total vibrational 399 
two-valued 393-7 
unit 374 
of matrices 33 
of operators 37-8, 44 
of wave functions 18 
Schrödinger 37 
Resonance 152 

scattering 552 - 64, 574-9 
Retardation, effective 633-7 
Rigid body, rotation of 412-21 
Rotary-reflection axis 357 
Rotation of molecules 316- 21, 323 - 31, 

421-5 

Russell-Saunders coupling 270 
Rutherford’s formula 567 
Rydberg’s correction 256 


Saddle-point method 655 
Scattering (see also Elasti collisions; In- 
elastic collisions) 
amplitude 505, 526 - 38 
nuclear 578 
partial 507 

general theory of 504 ff. 

length 547 

in magnetic field 544-5 

matrix 512 ff., 603-7 

operator 512 

potential 562, 611 

quasi-classical 521 — 6 

rainbow 526 n. 

resonance 552 ~— 64, 574-9 
Schrödinger representation 37 
Schrédinger’s equation 51 ff. 

in central field 102-3 

for free particle 51 

in homogeneous field 74 

for linear oscillator 69 

In magnetic field 455-8 

in one dimension 60, 64 

in quasi-classical case 164 
Second quantization 241 — 50 
Secular equation 139 
Selection rules 93 - 4, 98, 385 —9, 442 
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Self-adjoint operator, see Self-conjugate 
operator 
Self-conjugate operator 12 
Self-consistent field 252, 257 -— 60, 483 
Shell model of the nucleus 482-91 
Sign of terms 331 - 2, 425 
Gj-symbols 444-9 
Slow particles 
elastic scattering of 545-59 
inelastic scattering of 601 -3 
S-matrix 512; see also Scattering matrix 
Spectral terms 252, 300 
Spectrum 
continuous 8, 15-19, 567-71 
discrete 8-15 
of eigenvalues 8 
Spherical Bessel functions 107 
Spherical harmonics 91, 656-9 
Spherical tensor 441 
Spherical top 413 
Spin (VIH) 199 ff. 
commutation relations 201 
components 200 
in magnetic field 470-2 
nuclear 485 
operator 204 
total 202 
variable 200 
Spin - axis interaction 321 
Spin — orbit interaction 267 — 70, 321, 484, 
583 —7 
Spin - spin interaction 267, 270, 326 
Spinors 206 ff. 
contraction of 209 
metric 209 
multiplication of 209 
svmmetrical 209 
and tensors 210-15 
unit 209 
Spur of matrix 36 
Stability of molecules 407 — 12 
Stark effect 284 -99 
in diatomic molecule 337-8 
linear 289 
Stationary states 28 
Statistical weight 333 n., 427 n., 607 
Sum of quanuities 13 
Summation theorem 634 
Superposition principle 7 
s-wave scattering 546 
Symmetric term 332 
Symmetrization 
of functions 235 
of spinors 209 - 12 
Symmetry 
axis of 356 
bilaicral 363 
equivalent 363 
centre of 357 
charge +474. 
groups 359 il. 


Index 


Symmetry (cont.) 

plane of 356 
equivalent 363 

of terms 
in diatomic molecule 300- 1, 331-4 
in polyatomic molecules 398 ff. 

theory of (XIE) 356 ff. 

transformations 356-9 


Tensor 
antisymmetric unit 84 
forces 479 


irreducible 214 

matrix elements of 441-4 

spherical 441 
Thomas — Fermi method 261 -6, 277-9 
3;-symbols 433 - 40 
Threshold of reactions 618 


Time reversal 24, 55, 207 n., 223 —6, 436 n., 


455 n., 512, 606 
Time-reversed states 606 
Top 412-21 

asymmetrical 415-18 
spherical 413 
symmetrical 414-15 
Trace of matrix 36 
Transition 
frequency 31 
probability, in quasi-classical case 191 -5 
Transmission coefficient 77 
in quasi-classical case 182-5 
Transport cross-section 519, 581 
Transposed operator 11 
Tunnel effect 179 
Turning points 165 


Uncertainty 

principle 2 
relations 47 

for energy 157 
Unitarity condition 512 
complex 590 
Unitary operator 36 


Valency 309-16 
Van der Waals forces 342 
Variational principle 58 — 60 
Vector addition coefficients 436 
Vector model 100 
Velocity, in quantum mechanics 4, 55- 6, 
457 i 
Vibrational 
angular momentum 422 
coordinates 399 
energy levels 405-7 
quantum number 319 
states and rotational states in diatomic 
molecule 316-21, 323-31 


Vibrations, molecular 
anharmonic 405 
classification of 398 - 405 
interaction with rotations 42)]-5 
Virtual level 554 


Wall, potential 76-8, 170 
Wave equation 26 
Wave function 6, 21-4, 57 ff. 
antisymmetrical 228 
of boson system 229 
coordinate 230 
of fermion system 229 
in magnetic field 457. 458-9 
near nucleus 266-7 
orbital 230 
quasi-classical 164 ff. 
radial 103-5, 106-9, 117 -26 
spin 230 
for arbitrary spin 210-12 
symmetrical 228 
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Wave mechanics 2 
Wave number 61, 106 
Wave packet 21, 47 
Well, potential, see Potential well 
Width 

of channel 610-11 

of level 159, 559. 610 
Wigner-Eckart theorem 442 
Wigner 3j-symbols 435 


X-ray terms 279-81 


Young diagrams 236 - 41 


Zeeman effect 464-70, 501 
anomalous 465 n 


